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Preface 


This book consists of two separate, but closely related, parts. 
The first part (Chapters 1-10) is subtitled The Elements of Integra- 
tion] the second part (Chapters 11-17) is subtitled The Elements of 
Lebesgue measure. It is possible to read these two parts in either 
order, with only a bit of repetition. 

The Elements of Integration is essentially a corrected reprint of a 
book with that title, originally published in 1966, designed to present 
the chief results of the Lebesgue theory of integration to a reader hav- 
ing only a modest mathematical background. This book developed 
from my lectures at the University of Illinois, Urbana-Champaign, 
and it was subsequently used there and elsewhere with considerable 
success. Its only prerequisites are a understanding of elementary real 
analysis and the ability to comprehend “e-6 arguments”. We suppose 
that the reader has some familarity with the Riemann integral so that 
it is not necessary to provide motivation and detailed discussion, but 
we do not assume that the reader has a mastery of the subtleties of 
that theory. A solid course in “advanced calculus” , an understanding 
of the first third of my book The Elements of Real Analysis , or of 
most of my book Introduction to Real Analysis with D. R. Sherbert 
provides an adequate background. In preparing this new edition, I 
have seized the opportunity to correct certain errors, but I have re- 
sisted the temptation to insert additional material, since I believe 
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that one of the features of this book that is most appreciated is its 
brevity. 

The Elements of Lebesgue Measure is descended from class notes 
written to acquaint the reader with the theory of Lebesgue measure 
in the space R p . While it is easy to find good treatments of the case 
P = 1, the case p > 1 is not quite as simple and is much less frequently 
discussed. The main ideas of Lebesgue measure are presented in 
detail in Chapters 10-15, although some relatively easy remarks are 
left to the reader as exercises. The final two chapters venture into 
the topic of nonmeasurable sets and round out the subject. 

There are many expositions of the Lebesgue integral from various 
points of view, but I believe that the abstract measure space approach 
used here strikes directly towards the most important results: the 
convergence theorems. Further, this approach is particularly well- 
suited for students of probability and statistics, as well as students 
of analysis. Since the book is intended as an introduction, I do not 
follow all of the avenues that are encountered. However, I take pains 
not to attain brevity by leaving out important details, or assigning 
them to the reader. 

Readers who complete this book are certainly not through, but 
if this book helps to speed them on their way, it has accomplished its 
purpose. In the References, I give some books that I believe readers 
can profitably explore, as well as works cited in the body of the text. 

I am indebted to a number of colleagues, past and present, for 
their comments and suggestions; I particularly wish to mention N. T. 
Hamilton, G. H. Orland, C. W. Mullins, A. L. Peressini, and J. J. 
Uhl, Jr. I also wish to thank Professor Roy O. Davies of Leicester 
University for pointing out a number of errors and possible improve- 
ments. 

Robert G. Bartle 

Ypsilanti and Urbana 
November 20, 1994 
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CHAPTER 1 


Introduction 


The theory of integration has its ancient and honorable roots in the 
“method of exhaustion” that was invented by Eudoxos and greatly 
developed by Archimedes for the purpose of calculating the areas and 
volumes of geometric figures. The later work of Newton and Leibniz 
enabled this method to grow into a systematic tool for such calculations. 

As this theory developed, it has become less concerned with applica- 
tions to geometry and elementary mechanics, for which it is entirely 
adequate, and more concerned with purely analytic questions, for 
which the classical theory of integration is not always sufficient. 
Thus a present-day mathematician is apt to be interested in the con- 
vergence of orthogonal expansions, or in applications to differential 
equations or probability. For him the classical theory of integration 
which culminated in the Riemann integral has been largely replaced 
by the theory which has grown from the pioneering work of Henri 
Lebesgue at the beginning of this century. The reason for this is 
very simple: the powerful convergence theorems associated with the 
Lebesgue theory of integration lead to more general, more complete, 
and more elegant results than the Riemann integral admits. 

Lebesgue’s definition of the integral enlarges the collection of 
functions for which the integral is defined. Although this enlargement 
is useful in itself, its main virtue is that the theorems relating to the 
interchange of the limit and the integral are valid under less stringent 
assumptions than are required for the Riemann integral. Since one 
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frequently needs to make such interchanges, the Lebesgue integral is 
more convenient to deal with than the Riemann integral. To exemplify 
these remarks, let the sequence (/„) of functions be defined for x > 0 
by AW = e~ nx /Vx. It is readily seen that the (improper) Riemann 
integrals 

/* + oo p-nx 

1 


exist and that lim f n (x) = 0 for all x > 0. However, since lim f n {x) 

n~* co x-»0 

= +oo for each n, the convergence of the sequence is certainly not 
uniform for x > 0. Although it is hoped that the reader can supply 
the estimates required to show that lim /„ = 0, we prefer to obtain 
this conclusion as an immediate consequence of the Lebesgue Dominated 
Convergence Theorem which will be proved later. As another example, 
consider the function F defined for / > 0 by the (improper) Riemann 
integral 



x 2 e tx dx. 


With a little effort one can show that F is continuous and that its 
derivative exists and is given by 

F'{t) = - f x 3 e~ tx dx, 

Jo 

which is obtained by differentiating under the integral sign. Once 
again, this inference follows easily from the Lebesgue Dominated 
Convergence Theorem. 

At the risk of oversimplification, we shall try to indicate the crucial 
difference between the Riemann and the Lebesgue definitions of the 
integral. Recall that an interval in the set R of real numbers is a set 
which has one of the following four forms: 

[a, b] -■ {x e R : a ^ x ^ b}, (a, b) = {x 6 R : a < x < b}, 

[a, b) - {x e R : a < x < b} , (a, b] = {x e R : a < x < b). 

In each of these cases we refer to a and b as the endpoints and prescribe 
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b — a as the length of the interval. Recall further that if E is a set, 
then the characteristic function of E is the function xe defined by 

Xe(x) = 1 , if x e E, 

= 0, if x $ E. 

A step function is a function tp which is a finite linear combination of 
characteristic functions of intervals ; thus 

» 

9 = 2 C i Xe > ■ 

J = 1 

If the endpoints of the interval E } are a t , b u we define the integral 
of <p to be 

n 

9 = 2 ~ a t>- 

J-l 

If/ is a bounded function defined on an interval [a, b ] and if/ is not 
too discontinuous, then the Riemann integral of / is defined to be the 
limit (in an appropriate sense) of the integrals of step functions which 
approximate /. In particular, the lower Riemann integral of / may be 
defined to be the supremum of the integrals of all step functions <p 
such that <p(x) ^ f(x) for all x in [a, b], and <p(x) = 0 for x not in [a,b]. 

The Lebesgue integral can be obtained by a similar process, except 
that the collection of step functions is replaced by a larger class of 
functions. In somewhat more detail, the notion of length is generalized 
to a suitable collection X of subsets of R. Once this is done, the step 
functions are replaced by simple functions, which are finite linear 
combinations of characteristic functions of sets belonging to X. If 

n 

9 = 2 C I Xe, 

i 

is such a simple function and if n(E) denotes the “measure” or 
“generalized length” of the set £ in X, we define the integral of y to be 

{■9 = 2 

J i = i 

If/is a nonnegative function defined on R which is suitably restricted, 
we shall define the (Lebesgue) integral of / to be the supremum of the 
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integrals of all simple functions q> such that <p(x) < f(x) for all x in R. 
The integral can then be extended to certain functions that take both 
signs. 

Although the generalization of the notion of length to certain sets in 
R which are not necessarily intervals has great interest, it was observed 
in 1915 by Maurice Frechet that the convergence properties of the 
Lebesgue integral are valid in considerable generality. Indeed, let X 
be any set in which there is a collection X of subsets containing the 
empty set 0 and X and closed under complementation and countable 
unions. Suppose that there is a nonnegative measure function ^ 
defined on X such that /u(0) = 0 and which is countably additive in 
the sense that 

m(u £y) = J KE,) 

for each sequence (E,) of sets in X which are mutually disjoint. In 
this case an integral can be defined for a suitable class of real-valued 
functions on X, and this integral possesses strong convergence 
properties. 

As we have stressed, we are particularly interested in these con- 
vergence theorems. Therefore we wish to advance directly toward 
them in this abstract setting, since it is more general and, we believe, 
conceptually simpler than the special cases of integration on the line 
or in R n . However, it does require that the reader temporarily accept 
the fact that interesting special cases are subsumed by the general 
theory. Specifically, it requires that he accept the assertion that there 
exists a countably additive measure function that extends the notion 
of the length of an interval. The proof of this assertion is in Chapter 9 
and can be read after completing Chapter 3 by those for whom the 
suspense is too great. 

In this introductory chapter we have attempted to provide motivation 
and to set the stage for the detailed discussion which follows. Some 
of our remarks here have been a bit vague and none of them has been 
proved. These defects will be remedied. However, since we shall 
have occasion to refer to the system of extended real numbers, we now 
append a brief description of this system. 
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In integration theory it is frequently convenient to adjoin the two 
symbols -oo, +oo to the real number system R. (It is stressed that 
these symbols are not real numbers.) We also introduce the convention 
that -oo < .v < +oo for any xe R. The collection R consisting of 
the set R u {-co, +oo} js called the extended real number system. 

One reason we wish to consider R is that it is convenient to say that 
the length of the real line is equal to +oo. Another reason is that we 
will frequently be taking the supremum (= least upper bound) of a 
set of real numbers. We know that a nonempty set + of real numbers 
which has an upper bound also has a supremum (in R). If we define 
the supremum of a nonempty set which does not have an upper bound 
to be +oo, then every nonempty subset of R (or R) has a unique 
supremum in R. Similarly, every nonempty subset of R (or R) has a 
unique infimum (= greatest lower bound) in R. (Some authors 
introduce the conventions that inf0 = +oo, sup 0 = — oo, but we 
shall not employ them.) 

If (x„) is a sequence of extended real numbers, we define the limit 
superior and the limit inferior of this sequence by 

lim sup x n = inf ( sup x „ ) , 

lim inf x n = sup I inf x„) . 

If the limit inferior and the limit superior are equal, then their value is 
called the limit of the sequence. It is clear that this agrees with the 
conventional definition when the sequence and the limit belong to R. 

Finally, we introduce the following algebraic operations between 
the symbols ±oo and elements x e R: 

(±oo) + (±co) = x + (±oo) = (±oo) + x = ±oo, 
(±°o)(±°o) = +oo, (±oo)(+oo) = —oo, 
x(±oo) = (±oo)x = ±oo if x > 0, 

= 0 if x = 0, 

= +oo if x < 0. 

It should be noticed that we do not define (+oo) + (-oo) or (-oo) + 
(+co), nor do we define quotients when the denominator is ±co. 
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Measurable Functions 


In developing the Lebesgue integral we shall be concerned with 
classes of real-valued functions defined on a set X . In various applica- 
tions the set X may be the unit interval 7 — [0, 1] consisting of all real 
numbers .v satisfying 0 < x 1 ; it may be the set TV = {] , 2, 3, . . .} 
of natural numbers; it may be the entire real line R; it may be all of 
the plane; or it may be some other set. Since the development of the 
integral does not depend on the character of the underlying space X, 
we shall make no assumptions about its specific nature. 

Given the set X, we single out a family X of subsets of X which are 
“well-behaved” in a certain technical sense. To be precise, we shall 
assume that this family contains the empty set 0 and the entire set X, 
and that X is closed under complementation and countable unions. 

2. 1 Definition. A family X of subsets of a set X is said to be a 
a-algebra (or a <j-field) in case : 

(i) 0, A' belong to X. 

(ii) If A belongs to X, then the complement If (A) = X \ A belongs 
to X. 

(iii) If (A„) is a sequence of sets in X, then the union (J“ =1 A n be- 
longs to X. 

An ordered pair (X, X) consisting of a set X and a a-algebra X of 
subsets of X is called a measurable space. Any set in X is called an 
6 
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A-measurable set, but when the a-algebra X is fixed (as is generally the 
case), the set will usually be said to be measurable. 

The reader will recall the rules of De Morgan : 

(2.1) <r(u A tt ) = n nA a ), *(n a.) = u 

It follows from these that the intersection of a sequence of sets in X 
also belongs to X. 

We shall now give some examples of a-algebras of subsets. 

2.2 Examples, (a) Let X be any set and let X be the family of all 
subsets of X. 

(b) Let X be the family consisting of precisely two subsets of X, 
namely 0 and X. 

(c) Let X = {1,2,3,...} be the set N of natural numbers and let 
X consist of the subsets 

0, {1,3,5,...}, {2,4,6,...}, X. 

(d) Let X be an uncountable set and A" be the collection of subsets 
which are either countable or have countable complements. 

(e) If Xi and X 2 are a-algebras of subsets of X, let X 3 be the inter- 
section of X x and X 2 ; that is, X 3 consists of all subsets of X which 
belong to both X x and X 2 . It is readily checked that X 3 is a a-algebra. 

(f) Let A be a nonempty collection of subsets of X . We observe 
that there is a smallest a-algebra of subsets of X containing A . To see 
this, observe that the family of all subsets of X is a a-algebra containing 
A and the intersection of all the a-algebras containing A is also a 
a-algebra containing A. This smallest a-algebra is sometimes called 
the a-algebra generated by A . 

(g) Let X be the set R of real numbers. The Borel algebra is the 
a-algebra B generated by all open intervals (a, b) in R. Observe that 
the Borel algebra B is also the a-algebra generated by all closed intervals 
[a, b] in R. Any set in B is called a Borel set. 

(h) Let X be the set R of extended real numbers. If £ is a Borel 
subset of R, let 

(2.2) E x = £u{ — co}, E 2 = EKJ{+ oo}, E 3 = E u {-oo, +oo}, 
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and let B be the collection of all sets E, E 1 , E 2 , E 3 as E varies over B. 
It is readily seen that B is a a-algebra and it will be called the extended 
Borel algebra. 

In the following, we shall consider a fixed measurable space {X, X). 

2.3 Definition. A function /on X to R is said to be A-measurable 
(or simply measurable) if for every real number a the set 

( 2 -3) {xeX: f(x) > «} 

belongs to X. 

The next lemma shows that we could have modified the form of the 
sets in defining measurability. 

2.4 Lemma. The following statements are equivalent for a function 
f on X to R : 

(a) For every ae R, the set A„ = {x e X : fix) > a} belongs to X. 

(b) For every a eR, the set B a = {x e X : fix) sc a} belongs to X. 

(c) For every ae R, the set C a = {xe X : f(x) «} belongs to X. 

(d) For every ae R, the set D a = {x e X : fix) < a} belongs to X. 

proof. Since B a and A a are complements of each other, statement 
(a) is equivalent to statement (b). Similarly, statements (c) and (d) 
are equivalent. If (a) holds, then A a _ lln belongs to X for each n and 
since 

oo 

Ca — O A a-nn, 
n = l 

it follows that C a e X. Hence (a) implies (c). Since 


A a = U C a+ 

n= 1 


1 In s 


it follows that (c) implies (a). q.e.d. 

2.5 Examples, (a) Any constant function is measurable. For, 
i ff(x) = c for all x e X and if « ^ c, then 


{x e X : f{x) > «} = 0, 

whereas if a < c, then 


{xe X : fix) > ct} = X. 
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(b) If Ee X, then the characteristic function xe, defined by 

Xe(x) = 1 , xeE, 

= 0 , x$E, 

is measurable. In fact, {x e X \ xe( x ) > a} is either X, E, or 0. 

(c) If X is the set R of real numbers, and X is the Borel algebra B, 
then any continuous function / on R to R is Borel measurable (that is, 
U-measurable). In fact, if / is continuous, then {xe R : fix) > a} is 
an open set in R and hence is the union of a sequence of open intervals. 
Therefore, it belongs to B. 

(d) If X = R and X — B, then any monotone function is Borel 
measurable. For, suppose that / is monotone increasing in the sense 
that x < x' implies fix) ^ fix'). Then {xe R : fix) > a} consists of 
a half-line which is either of the form {xe R \ x > a) or the form 
{x gR : x ^ a}, or is R or 0. 

Certain simple algebraic combinations of measurable functions are 
measurable, as we shall now show. 

2.6 Lemma. Let f and g be measurable real-valued functions and let 
c be a real number. Then the functions 

cf, f 2 , f + g, fg, |/|, 

are also measurable. 

proof, (a) If c = 0, the statement is trivial. If c > 0, then 
{xel: cf(x) > a} = {x e X : f(x) > a/c} e X. 

The case c < 0 is handled similarly. 

(b) If a < 0, then {x e X : (f(x)) 2 > a} = X\ if a > 0, then 

{xe X : {fix)) 2 > «} 

= {x e X : f{x) > Va} u {x e X : fix) < - Va}. 

(c) By hypothesis, if r is a rational number, then 

S r = {x e X : fix) > r} n {x e X : gix) > a - r} 
belongs to X. Since it is readily seen that 

{xe X : if + gX*) > a} = (J { S r : r rational} , 
it follows that f + g is measurable. 
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(d) Since fg = }[(/ + g) 2 — if - g) 2 ], it follows from parts (a), (b), 
and (c) that/g is measurable. 

(e) If a < 0, then {xe X : |/(x)| > a} = X, whereas if a > 0, then 
{xe X : \f(x)\ > a} = {x e X : f(x) > ci)u{ieI: /(x) < -a}. 

Thus the function |/| is measurable. q.e.d. 

If /is any function on X to R, let/ + and f~ be the nonnegative 
functions defined on X by 

(2.4) f + (x) = sup {f(x), 0}, f-{x) = sup { fix) , 0}. 

The function / + is called the positive part of / and f~ is called the 
negative part of /. It is clear that 

(2.5) / = / + -/" and |/| =/ + + f~ 
and it follows from these identities that 

(2.6) r = w \ +/),/- =ki/i -/). 

In view of the preceding lemma we infer that / is measurable if and 
only if f + and f~ are measurable. 

The preceding discussion pertained to real-valued functions defined 
on a measurable space. However, in dealing with sequences of 
measurable functions we often wish to form suprema, limits, etc., and 
it is technically convenient to allow the extended real numbers — oo , + oo 
to be taken as values. Hence we wish to define measurability for 
extended real-valued functions and we do this exactly as in Definition 2.3. 

2.7 Definition. An extended real-valued function on X is X- 
measurable in case the set {x e X : fix ) > a} belongs to X for each real 
number a. The collection of all extended real- valued AT-measurable 
functions on X is denoted by M{X, X) . 

Observe that if fe M(X, X) , then 

{xeX: fix) = +oo} = f) {x e X : /(x) > «}, 

n = 1 

{xe X : fix) = -co} = (xeX: fix) > -«}] , 

so that both of these sets belong to X. 
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The following lemma is often useful in treating extended real-valued 
functions. 

2.8 Lemma. An extended real-valued function f is measurable if and 
only if the sets 

A = {xe X : f(x) = +co}, B = {xe X : f(x) = -oo} 

belong to X and the real-valued function / defined by 

fi(x) = f(x), if x $ A u B, 

= 0, ifxe A u B, 

is measurable. 

proof. If/ is in M{X, X ), it has already been noted that A and B 
belong to X. Let ae R and a > 0, then 

{xe X : ffx) > a.} = {xe X : f(x) > a}\A. 

If a < 0, then 

{xe X : ffix) > a} = {xeX : f(x) > a} u B. 

Hence f is measurable. 

Conversely, if A, BeX and f is measurable, then 

{xe X : f{x) > «} = {x e X : ffix) > <x} u A 
when a > 0, and 

{xeX:f(x) > a} = {xeX-.ffx) > a}\B 

when a < 0. Therefore /is measurable. q.e.d. 

It is a consequence of Lemmas 2.6 and 2.8 that if/ is in M(X, X), 
then the functions 

cf, P, |/|, f\ f~ 

also belong to M(X, X) . 

The only comment that need be made is that we adopt the convention 
that 0( ±oo) = 0 so that cf vanishes identically when c = 0. If/ and g 
belong to M{X,X), then the sum / + g is not well-defined by the 
formula (/ + g)(x) = f(x) + g(x) on the sets 

E x = {xe X :f(x) = -oo and g(x) = +oo}, 

E 2 = {xe X : f(x) = +co and g(x) = — oo}, 
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both of which belong to X. However, if we define / + g to be zero 
on E 1 kj E 2 , the resulting function on X is measurable. We shall 
return to the measurability of the product fg after the next result. 

2.9 Lemma. Let iff) be a sequence in M(X,X) and define the 
functions 

f{x) = inf /„(*), F(x) = sup f n (x), 
f*(x ) = lim inf/„(x), F*(x) = lim sup/ n (*). 

Then /, F,f*, and F* belong to M(X, X). 

proof. Observe that 

{xe X : f(x) > «} = f) {xe X : f n (x) > a}, 

n = 1 

{X€ X : Fix) > a] = (j {xe X : f n ix) > a}, 

n = 1 

so that / and fare measurable when all the/„ are. Since 
f*(x) = sup { inf / m (x)l, 

F*(x) = inf { sup /„(*)}, 

the measurability of /* and F* is also established. q.e.d. 

2.10 Corollary. Ifif n ) is a sequence in Mi X, X) which converges 
to f on X, then f is in MiX, X). 

proof. In this case fix) — lim f n {x) = lim inf f n (x). q.e.d. 

We now return to the measurability of the product / g when /, g 
belong to M( X, X). IfneN, let /„ be the “truncation of '/” defined by 

fnix) = fix), if |/(x)| ^ n, 

= n, if fix) > n, 

— —n, if fix) < —n. 

Let g m be defined similarly. It is readily seen that f„ and g m are 
measurable (see Exercise 2.K). It follows from Lemma 2.6 that the 
product f n g m is measurable. Since 

fix) g m ix) = lim f r ix) g m ix), xeX, 
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it follows from Corollary 2.10 that f g m belongs to M(X, X). Since 
(fg)(x) = fix) g(x) = lim /(x) g m (x), xeX, 

m 

another application of Corollary 2. 10 shows that / g belongs to M(X, X) . 

It has been seen that the limit of a sequence of functions in M(X, X) 
belongs to M(X, X) . We shall now prove that a nonnegative function 
/ in M(X, X) is the limit of a monotone increasing sequence (<p n ) in 
M(X, X). Moreover, each <p n can be chosen to be nonnegative and to 
assume only a finite number of real values. 

2.11 Lemma. If f is a nonnegative function in M(X,X), then there 
exists a sequence (<p n ) in M(X, X) such that 

(a) 0 sg cp n (x) < cp n + 1 (x)for xe X,neN. 

(b) /(x) = lim <pfx) for each xe X. 

(c) Each <p n has only a finite number of real values. 

proof. Let n be a fixed natural number. If k = 0, 1 , . . . , n2 n — 1 , 
let E kn be the set 

E kn = {x e X : kl~ n < f(x) < (k + 1)2"*}, 

and if k = n2 n , let E kn be the set {x e X : f(x) > n). We observe that 
the sets {E kn : k — 0,1,..., n2 n } are disjoint, belong to X, and have 
union equal to X. If we define <p„ to be equal to k2~ n on E kn , then <p n 
belongs to M(X, X). It is readily established that the properties (a), 
(b), (c) hold. Q.E.D. 

COMPLEX-VALUED FUNCTIONS 

It is frequently important to consider complex-valued functions 
defined on X and to have a notion of measurability for such functions. 
We observe that if /is a complex-valued function defined on X, then 
there exist two uniquely determined real- valued functions fi,f 2 such 
that 

f — fi + 

(Indeed, ffx) = Re/(x), fifx) = Im f(x), for x e X.) We define the 
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complex-valued function /to be measurable if and only if its real and 
imaginary parts /i and / 2 , respectively, are measurable. It is easy to 
see that sums, products, and limits of complex-valued measurable 
functions are also measurable. 

FUNCTIONS BETWEEN MEASURABLE SPACES 

In the sequel we shall require the notion of measurability only for 
real- and complex-valued functions. In some work, however, one 
wishes to define measurability for a function / from one measurable 
space ( X , X) into another measurable space (7, Y ). In this case one 
says that / is measurable in case the set 

f~\E) = {xeX-.f{x)eE} 

belongs to X for every set E belonging to Y. Although this definition 
of measurability appears to differ from Definition 2.3, it is not difficult 
to show (see Exercise 2.P) that Definition 2.3 is equivalent to this 
definition in the case that Y — R and Y = B. 

This definition of measurability shows very clearly the close analogy 
between the measurable functions on a measurable space and continuous 
functions on a topological space. 

EXERCISES 

2. A. Show that [a, b] = H”=i (ft — l/n, b + l/n). Hence any o- 
algebra of subsets of R which contains all open intervals also contains 
all closed intervals. Similarly, (a, b) = (J” =1 [a + l/n, b - l/n], so 
that any <7-algebra containing all closed intervals also contains all 
open intervals. 

2.B. Show that the Borel algebra B is also generated by the collection 
of all half-open intervals (a, b] = {x e R : a < x ^ b}. Also show 
that B is generated by the collection of all half-rays {x e R : x > a}, 

ae R. 

2.C. Let (A n ) be a sequence of subsets of a set X. Let E 0 = 0 and 
for ne N, let 

En = U A k , F n = A n \E n - 1 . 

k= 1 
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Show that ( E n ) is a monotone increasing sequence of sets and that (F n ) 
is a disjoint sequence of sets (that is, F n n F m = 0 if n # m) such that 

00 co oo 

u E n = U F n = U A n . 

n — 1 n = 1 n= 1 


2.D. Let ( A n ) be a sequence of subsets of a set X. If A consists of 
all x 6 X which belong to infinitely many of the sets A n , show that 


00 r- CO T 

n u aX 

m=H n = m J 


The set A is often called the limit superior of the sets (A n ) and denoted 
by lim sup A„ . 


2.E. Let (A n ) be a sequence of subsets of a set X. If B consists of 
all x e X which belong to all but a finite number of the sets A n , show 
that 

CO r 00 -i 

b= u n aX 

m = 1 L n = m J 


The set B is often called the limit inferior of the sets ( A n ) and denoted 
by lim inf A n . 

2.F. If ( E n ) is a sequence of subsets of a set X which is monotone 
increasing (that is, Ei £ E 2 £ E 3 £ • • •), show that 

* oo 

lim sup E n — 1J E n — lim inf E n . 

n- 1 

2.G. If (F n ) is a sequence of subsets of a set X which is monotone 
decreasing (that is, Fj 2 F z 2 F 3 3 • • •), show that 

oo 

lim sup F n = n F n = lim inf F n . 

n = 1 

2.H. If ( A n ) is a sequence of subsets of X, show that 

0 £ lim inf A n £ lim sup A n £ X. 

Give an example of a sequence ( A n ) such that 


lim inf A n = 0, lim sup A n — X. 


Give an example of a sequence ( A n ) which is neither monotone 
increasing or decreasing, but is such that 


lim inf A n = lim sup A„. 
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When this equality holds, the common value is called the limit of (A n ) 
and is denoted by lim A n . 

2.1. Give an example of a function / on X to R which is not X- 
measurable, but is such that the functions |/| and f 2 are ^measurable. 

2.J. If a, b, c are real numbers, let mid (a, b, c ) denote the “value in 
the middle.” Show that 

mid (a, b, c ) = inf {sup {a, b}, sup {a, c}, sup {b, c}}. 

If f\ , fi , fz are A'-measurable functions on X to R and if g is defined 
for x e X by 

g(x) = mid (ffx), f 2 (x), f 3 (x )) , 
then g is ^-measurable. 

2.K. Show directly (without using the preceding exercise) that if / 
is measurable and A > 0, then the truncation f A defined by 

!a(x) = f(x), if |/(x)| < A , 

= A , if f(x) > A , 

= -A, ■ if f(x ) < -A, 

is measurable. 

2.L. Let / be a nonnegative .^-measurable function on X which is 
bounded (that is, there exists a constant K such that 0 sg f{x) < K for 
all xin X). Show that the sequence (<p n ) constructed in Lemma 2.11 
converges uniformly on X to /. 

2.M. Let / be a function defined on a set X with values in a set Y. 
If E is any subset of Y, let 

f-\E) = {xeX :/(jc)e£}. 

Show that / -1 (0) = 0, / _1 (^) = X. If £ and F are subsets of Y, 
then 

f-\E\F ) =f~\E)\f-\F). 

If {E a } is any nonempty collection of subsets of Y, then 

/ _i (u E a ) = u r\E a ), f-'(n E a ) = n/-m). 

In particular it follows that if Y is a a-algebra of subsets of Y, then 
(/ -1 (£) : Ee Y} is a a-algebra of subsets of X. 
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2.N. Let / be a function defined on a set X with values in a set Y . 
Let X be a a-algebra of subsets of X and let Y = {E s Y : f~ 1 (E) e X}. 
Show that Y is a a-algebra. 

2.0. Let (X, X) be a measurable space and /be defined on X to Y. 
Let A be a collection of subsets of Y such that f~ 1 (E) e X for every 
E e A. Show that /“ 1 (F) e X for any set F which belongs to the 
a-algebra generated by A . (Hint: Use the preceding exercise.) 

2.P. Let (X, X ) be a measurable space and /be a real-valued function 
defined on X. Show that /is A'-measurable if and only if f^ 1 (E) e X 
for every Borel set E. 

2.Q. Let (X, X) be a measurable space,/be an ^-measurable function 
on X to R and let <p be a continuous function on R to R. Show that 
the composition cp ° /, defined by (<p ° f)(x) = <p[f(x)\ , is A'-measurable. 
(Hint: If <p is continuous, then 9 _1 (£) e B for each E e B.) 

2.R. Let / be as in the preceding exercise and let ^ be a Borel 
measurable function. Show that </">/is ^-measurable. 

2.S. Let / be a complex-valued function defined on a measurable 
space (X, X). Show that /is A’-measurable if and only if 

{xe X : a < Re f(x) < b, c < Im f(x ) < d } 

belongs to X for all real numbers a,b,c,d. More generally, / is 
AT-measurable if and only if f~ 1 (G) 6 X for every open set G in the 
complex plane C. 

2.T. Show that sums, products, and limits of complex- valued 
measurable functions are measurable. 

2.U. Show that a function / on X to R (or to R) is A'-measurable if 
and only if the set A a in Lemma 2.4(a) belongs to X for each rational 
number a; or, if and only if the set B x in Lemma 2.4(b) belongs to X 
for each rational number a; etc. 

2.V. A nonempty collection M of subsets of a set X is called a 
monotone class if, for each monotone increasing sequence ( E n ) in M 
and each monotone decreasing sequence ( F n ) in M, the sets 

oo oo 

U En, n Fn 

n=l n=l 
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belong to M. Show that a o-algebra is a monotone class. Also, if 
A is a nonempty collection of subsets of X, then there is a smallest 
monotone class containing A . (This smallest monotone class is called 
the monotone class generated by A .) 

2.W. If A is a nonempty collection of subsets of X, then the a-algebra 
S generated by A contains the monotone class M generated by A. 
Show that the inclusion A c M c S may be proper. 



CHAPTER 3 


Measures 


We have introduced the notion of a measurable space (X, X ) con- 
sisting of a set X and a a-algebra X of subsets of X . We now consider 
certain functions which are defined on X and have real, or extended 
real values. These functions, which will be called “measures,” are 
suggested by our idea of length, area, mass, and so forth. Thus it is 
natural that they should attach the value 0 to the empty set 0 and that 
they should be additive over disjoint sets in X. (Actually we shall 
require that they be countably additive in the sense to be described 
below.) It is also desirable to permit the measures to take on the 
extended real number +oo. 

3.1 Definition. A measure is an extended real- valued function ^ 
defined on a cr-algebra X of subsets of X such that (i) /x(0) = 0, (ii) 
n(E) ^ 0 for all E e X, and (iii) /x is countably additive in the sense 
that if (E n ) is any disjoint sequence* of sets in X, then 

(3- 1 ) /*( 0 E n ) = J n{E n ). 

Since we permit /x to take on +oo, we remark that the appearance of 
the value +co on the right side of the equation (3.1) means either that 
i'AE n ) = +co for some n or that the series of nonnegative terms on the 
right side of (3.1) is divergent. If a measure does not take on +oo, 

* This means that E n n E m = 0 if « ^ m . 

19 
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we say that it is finite. More generally, if there exists a sequence (£„) 
of sets in X with X = \ J E n and such that p(E n ) < +oo for all n, then 
we say that /j, is a-finite. 

3.2 Examples, (a) Let X be any nonempty set and let X be the 
CT-algebra of all subsets of X. Let p r be defined on X by 

Pi(E) = 0, for all EeX\ 

and let p 2 be defined by 

/x 2 (0) = 0, p 2 (E) = if E # 0. 

Both fj-! and p 2 are measures, although neither one is very interesting. 
Note that p 2 is neither finite nor a-finite. 

(b) Let {X, X) be as in (a) and let p be a fixed element of X. Let p 
be defined for E e X by 

p(E) = 0, if p $ E, 

= 1 , if p 6 E. 

It is readily seen that p is a finite measure; it is called the unit measure 
concentrated at p . 

(c) Let X = N = {1 , 2, 3, . . .} and let X be the a-algebra of all 
subsets of N. If E e X, define p(E) to be equal to the number of 
elements in £ if £ is a finite set and equal to +oo if £ is an infinite set. 
Then p is a measure and is called the counting measure on N. Note 
that p is not finite, but it is a-finite. 

(d) If X = R and X = B, the Borel algebra, then it will be shown 
in Chapter 9 that there exists a unique measure A defined on B which 
coincides with length on open intervals. [By this we mean that if £ 
is the nonempty interval (a, b), then A(£) = b — a.\ This unique 
measure is usually called Lebesgue (or Borel) measure. It is not a 
finite measure, but it is a-finite. 

(e) If X = R, X = B, and / is a continuous monotone increasing 
function, then it will be shown in Chapter 9 that there exists a unique 
measure A ; defined on B such that if £ = (a, b), then A f (E) = f(b) — 
f(a). This measure A ; is called the Borel-Stieltjes measure generated 

by/- 

We shall now derive a few simple results that will be needed later. 
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3.3 Lemma. Let p be a measure defined on a o-algebra X . If E and 
F belong to X and E £ F, then /j.(E ) < p(F). If ifE) < +co, then 
m(F\ E) = n(F) - KE). 

proof. Since F = £u (F\E) and E n (F\ E) = 0, it follows that 

MF) = tfF) + KF\ E). 

Since /*(F\F) ^ 0, it follows that /n(F ) ^ /n(F). If m(F) < +oo, 
then we can subtract it from both sides of this equation. q.e.d. 

3.4 Lemma. Let n be a measure defined on a o-algebra X. 

(a) 7/(F„) is an increasing sequence in X, then 

(3.2) /*( U i E n ) = lim /*(£„). 

(b) If (F n ) is a decreasing sequence in X and if ^(Ffi < +oo, then 

(3.3) J n F„) = lim^Fn). 

proof, (a) If t*(E n ) = + co for some n, then both sides of equation 
(3.2) are +oo. Hence we can suppose that /*(£„) < +oo for all n. 

Let A l = £) and A n = E n \ E n . 1 for n > 1 . Then ( A n ) is a disjoint 
sequence of sets in X such that 

E n = U A t , 0 E n = 0 A n . 

j = 1 n = 1 n = 1 

Since /jl is countably additive, 

/ 'X) \ in 

U ^ = 2 F(A n ) = lim 2 KA n )- 

Vn=1 ' n =1 #=1 

By Lemma 3.3 n(A n ) = n(E n ) — ^(F n _j) for n > 1 , so the finite series 
on the right side telescopes and 

2 M^n) = MF m ) . 


Hence equation (3.2) is proved. 
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(b) Let E n = F x \F n , so that (E n ) is an increasing sequence of sets 
in X. If we apply part (a) and Lemma 3.3, we infer that 

/*( 0 i E n) = lim j±{E n ) = lim - /x(F n )] 

= K E i) ~ lim KF n ). 

Since (J" = 1 E n = F x \P|” =1 it follows that 

/*(u i £.) = n/n). 

Combining these two equations, we obtain (3.3). q.e.d. 

3.5 Definition. A measure space is a triple ( X , X, jx) consisting 
of a set X, a a-algebra X of subsets of X, and a measure /x defined on X. 

There is a terminological matter that needs to be mentioned and 
which shall be employed in the following. We shall say that a certain 
proposition holds /^-almost everywhere if there exists a subset TV e X 
with n(N) = 0 such that the proposition holds on the complement 
of TV. Thus we say that two functions /, g are equal /x-almost every- 
where or that they are equal for ^-almost all x in case f(x) = g(x) when 
x $ TV, for some TV e X with fx(N) = 0 . In this case we will often 
write 

f=g, fi-a.e. 

In like manner, we say that a sequence (/„) of functions on X converges 
/^-almost everywhere (or converges for /^.-almost all x) if there exists a 
set TV e X with /x(TV) = 0 such that f(x) = lim f n (x) for x TV. In this 
case we often write 

/= lim/ n , /x-a.e. 

Of course, if the measure /a is understood, we shall say “almost every- 
where” instead of ‘V-almost everywhere.” 

There are some instances (suggested by the notion of electrical 
charge, for example) in which it is desirable to discuss functions which 
behave like measures except that they take both positive and negative 
values. In this case, it is not so convenient to permit the extended real 
numbers +oo, — oo to be values since we wish to avoid expressions of 
the form (+oo) + (-oo). Although it is possible to handle “signed 
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measures” which take on only one of the values +oo, — oo, we shall 
restrict our attention to the case where neither of these symbols is 
permitted. To indicate this restriction, we shall introduce the term 
“charge,” which is not entirely standard. 

3.6 Definition. If X is a a-algebra of subsets of a set X, then a 
real-valued function A defined on X is said to be a charge in case 
A(0) = 0 and A is countably additive in the sense that if (E n ) is a disjoint 
sequence of sets in X, then 

A( 0 En) = t A (/•„). 

[Since the left-hand side is independent of the order and this equality 
is required for all such sequences, the series on the right-hand side 
must be unconditionally convergent for all disjoint sequences of 
measurable sets.] 

It is clear that the sum and difference of two charges is a charge. 
More generally, any finite linear combination of charges is a charge. 
It will be seen in Chapter 5 that functions which are integrable over a 
measure space ( X , A", /a) give rise to charges. Later, in Chapter 8, 
we will characterize those charges which are generated by integrable 
functions. 

EXERCISES 

3.A. If /a is a measure on X and A is a fixed set in X, then the 
function A , defined for E e X by A (E) = /a (A n E), is a measure on X. 

3.B. If Mi , • ■ • > Fn are measures on X and a lf ... ,a n are nonnegative 
real numbers, then the function A, defined for E e X by 

KE) = 2 a if Xj{E), 

i= 1 

is a measure on X. 

3.C. If (ju-n) is a sequence of measures on X with n n (X) = 1 and if A 
is defined by 

oo 

A (E) = 2 2~Vn(£), EeX_ 

n= 1 
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then A is a measure on X and A(A') = 1 . 

3.D. Let X — N and let X be the cr-algebra of all subsets of N. If 
( a n ) is a sequence of nonnegative real numbers and if we define fx by 

+( 0 ) = 0; fx(E) = 2 a n, E # 0; 

ns E 

then ytx is a measure on X. Conversely, every measure on X is obtained 
in this way for some sequence (a n ) in R . 

3.E. Let X be an uncountable set and let X be the family of all 
subsets of X. Define fx on E in X by requiring that /x(E) — 0, if E is 
countable, and /u(£) = +co, if E is uncountable. Show that ^ is a 
measure on X. 

3.F. Let X = TV and let X be the family of all subsets of TV. If E is 
finite, let fx(E) = 0; if E is infinite, let fx(E) = +co. Is p a measure 
on XI 

3.G. If X and X are as in Exercise 3.F, let A(£) = +oo for all E e X. 
Is A a measure? 

3.H. Show that Lemma 3.4(b) may fail if the finiteness condition 
^(/q) < +oo is dropped. 

3.1. Let (X, X, /r) be a measure space and let (£„) be a sequence in 
A". Show that 

/n(lim inf £„) < lim inf ^(£„). 

[See Exercise 2.E.] 

3.J. Using the notation of Exercise 2.D, show that 
lim sup m(£„) /i(lim sup E n ) 

when^(U E n ) < +go . Show that this inequality may fail if ^((J £ n ) = 
+oo. 

3.K. Let (X, X, fx) be a measure space and let Z = {£ e X : ^.(£) = 
0}. Is Za CT-algebra? Show that if Ee Zand Ee X, then £ n Fe Z. 
Also, if E n belongs to Z for n e N, then \J E n e Z. 

3.L. Let X, X, fi, Z be as in Exercise 3.K and let X' be the family of 
all subsets of X of the form 


(£UZ 1 )\Z 2 , Ee X, 
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where Z x and Z 2 are arbitrary subsets of sets belonging to Z. Show 
that a set is in X' if and only if it has the form £u2 where EeX and Z 
is a subset of a set in Z. Show that the collection X' forms a a-algebra 
of sets in X. The cr-algebra X' is called the completion of X (with 
respect to /x) . 

3.M. With the notation of Exercise 3.L, let /x' be defined on X' by 
,,'(EXJZ) = n{E), 

when EeX and Z is a subset of a set in Z. Show that p.' is well- 
defined and is a measure on A" which agrees with /x on X. The 
measure yi is called the completion of /x. 

3.N. Let (X,X,/jl) be a measure space and let (X, X' , y.') be its 
completion in the sense of Exercise 3.M. Suppose that / is an X'- 
measurable function on X to R . Show that there exists an AT-measurable 
function g on X to R which is /x-almost everywhere equal to f. {Hint: 
For each rational number r, let A r = {x : f{x) > r} and write A r = 
E r uZ r , where E r e X and Z r is a subset of a set in Z. Let Z be a set 
in Z containing IJ Z r and define g(x) = f{x) for x$Z, and g(x) = 0 
for x eZ. To show that g is ^-measurable, use Exercise 2.U.) 

3.0. Show that Lemma 3.4 holds if /x is a charge on X. 

3.P. If /x is a charge on X, let -n be defined for E e X by 

■n(E) = sup {/lx(T) : A ^ E, A eX}. 

Show that 7T is a measure on X. {Hint: If ■> r{E^ < oo and % > 0, let 
F n eX be such that F n S ^ a nd n{E^ sc ,4F n ) + 2~ n e.) 

3.Q. If /x is a charge on X, let v be defined for E e X by 

v{E) = sup 2 HAM , 

; = i 

where the supremum is taken over all finite disjoint collections {A f } 
in X with E = lj?=i A s . Show that v is a measure on X. (It is called 
the variation of g.) 

3.R. Let A denote Lebesgue measure defined on the Borel algebra B 
of R [see Example 3.2(d)], (a) If E consists of a single point, then 
Ee B and A(£) =0. (b) If E is countable, then Ee B and \{E) = 0. 
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(c) The open interval (a, b), the half-open intervals (a, b], [a, b), and 
the closed interval [a, b ] all have Lebesgue measure b - a. 

3.S. If A denotes Lebesgue measure and E is an open subset of R, 
then A (E) > 0 if and only if E is nonvoid. Show that if if is a 
compact subset of R, then X(K) < + 00 . 

3.T. Show that the Lebesgue measure of the Cantor set (see Reference 
[1], p. 52) is zero. < 

3.U. By varying the construction of the Cantor set, obtain a set of 
positive Lebesgue measure which contains no nonvoid open interval. 

3.V. Suppose that £ is a subset of a set N e X with fi(N ) = 0 but 
that E$X. The sequence (/„),/„ = 0, converges /x-almost everywhere 
to xe ■ Hence the almost everywhere limit of a sequence of measurable 
functions may not be measurable. 



CHAPTER 4 


The Integral 


In this chapter we shall introduce the integral first for nonnegative 
.simple measurable functions and then for arbitrary nonnegative 
extended, real-valued measurable functions. The principal result is 
the celebrated Monotone Convergence Theorem, which is a basic 
tool for everything that follows. 

Throughout this chapter we shall consider a fixed measure space 
(X,X,fi). We shall denote the collection of all Z-measurable 
functions on X to R by M = M(X, X) and the collection of all non- 
negative ^-measurable functions on X to R by M + = M + {X,X). 
We shall define the integral of any function in M + with respect to the 
measure p. In order to do so we shall find it convenient to introduce 
the notion of a simple function. It is convenient to require that simple 
functions have values in R rather than in R. 

4.1 Definition. A real- valued function is simple if it has only a 
finite number of values. 

A simple measurable function <p can be represented in the form 
( 4<1 ) 9 = 2 

; = i 

where a, e R and xe, is the characteristic function of a set E,- in X. 
Among these representations for <p there is a unique standard repre- 
sentation characterized by the fact that the a, are distinct and the E, 


27 



28 The Elements of Integration 


are disjoint nonempty subsets of X and are such that X = (J" =1 Ej- 
(Of course, if we do not require the a,- to be distinct, or the sets E, to 
be disjoint, then a simple function has many representations as a linear 
combination of characteristic functions.) 

4.2 Definition. If 93 is a simple function in M + (X, X) with the 
standard representation (4.1), we define the integral of <p with respect to 
P to be the extended real number 

(4.2) dp = ^ a f p(Ej) . 

J j-1 

In the expression (4.2) we employ the convention that 0(+oo) = 0 
so the integral of the function identically 0 is equal to 0 whether the 
space has finite or infinite measure. It should be noted that the value 
of the integral of a simple function in M + is well-defined (although it 
may be +00) since all the a, are nonnegative, and so we do not encounter 
meaningless expressions such as (+00) - (+00). 

We shall need the following elementary properties of the integral. 

4.3 Lemma, (a) If <p and f are simple functions in M + (X, X) and 
c tz 0, then 

j Cep dp = c j ep dp, 

j(<P +>{>) dp = jepdp + dp. 

(b) If A is defined for E in X by 

= jvXEdp, 

then A is a measure on X. 

proof. If c = 0, then c<p vanishes identically and the equality 
holds. If c > 0, then cep is in M + with standard representation 

n 

crp = 2 Ca iXE,, 

J = 1 
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when rp has standard representation ( 4 . 1 ). Therefore 

I cep dfi = ^ ca i = c 2 ^ = c I 9 dp-. 

J 3 = 1 J=1 J 

Let 9? and $ have standard representations 


9=2 a t * e < > 'A = 2 

;=1 fc = l 

then 93 + ip has a representation 

n m 

9 + 4> = 2 2 + h ^XE,r,F k ■ 

j=l k= 1 

However, this representation of 9 + as a linear combination of 
characteristic functions of the disjoint sets E f O is not necessarily 
the standard representation for 9 + >/>, since the values a s + b k may 
not be distinct. Let c h , h = 1 , . . . , p , be the distinct numbers in the 
set {a, + b k : j — 1 , . . . , n ; k = \ , ... ,m} and let G h be the union 
of all those sets Ej nf^O such that aj + b k = Ch- Thus 


d c 'h) = 2 ^ E i n F k)> 




where the notation designates summation over all j, k such that 
+ b k = c h . Since the standard representation of 9 + is given by 

■A 

9 + 9 = 2 , ka h , 

h = 1 

we find that 

(95 + ifi) dn = 2 MGft) = 2 2 Cft ti E j n F k) 

h = 1 fc = l (ft) 

= 2 2 / x (^ n ^ 
ft = 1 (ft) 

n m 

= 22 {a > + *fc) ^ e j n ^ 

y-i fc = 1 

n m n m 

2 2 a i ^ E i ° + 2 2 b * f(e i n - F ^- 


I 


j=i fc= 1 


y=i fc=i 
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Since X is the union of both of the disjoint families {£,} and {F k ), then 
/x(£,) = 2 /*(£, n F k ), n(F k ) = J l4E, n F k ). 

k=l j = i 

We employ this observation (and change the order of summation in 
the second term) to obtain the desired relation 

/• n m 

\ (<p + ifi) dn = 2 a i m(£)) +2 bk E{F k ) 

J y=i fc = i 

= J rf/tt + J ifi d/jL . 

To establish part (b), we observe that 


<P Xe — 2 a iXEj^E- 


Hence, it follows by induction from what we have proved that 
A(£) = | n£ ^ = 2 a i f As, ns dfi. — 2 a , l l (E, O £■). 

J )-l J j—1 

Since the mapping £ /*(£) n £) is a measure (see Exercise 3. A) we 

have expressed A as a linear combination of measures on X. It follows 
(see Exercise 3.B) that A is also a measure on X. q.e.d. 

We are now prepared to introduce the integral of an arbitrary 
function in M + . Observe that we do not require the value of the 
integral to be finite. 

4.4 Definition. If / belongs to M + ( X, X), we define the integral 
of / with respect to /x to be the extended real number 


(4.3) 


J fdfj. = sup jyd/x, 


where the supremum is extended over all simple functions <p in 
M + (X, X) satisfying 0 < <p(x) < f(x) for all xe X. If / belongs to 
M + {X, X) and E belongs to X, then /y E belongs to M + (X, X) and we 
define the integral of / over E with respect to 94 to be the extended real 
number 


(4.4) 


J fi fdp = J f Xe dp. 
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We shall first show that the integral is monotone both with respect 
to the integrand and the set over which the integral is extended. 

4.5 Lemma, (a) Iff and g belong to M + (X, X) and f =% g, then 

(4.5) jfdp^jgdii. 

(b) If f belongs to M + (X, X), if E, F belong to X, and if E £ F, then 

fdfj. < fdp. 

proof, (a) If <p is a simple function in M + such that 0 ^ <p < /, 
then 0 < <p g. Therefore (4.5) holds. 

(b) Since f\ E < f\ F , part (b) follows from (a). q.e.d. 

We are now prepared to establish an important result due to B. Levi. 
This theorem provides the key to the fundamental convergence 
properties of the Lebesgue integral. 

4.6 Monotone Convergence Theorem. If (/„) is a monotone 
increasing sequence of functions in M~(X, X) which converges to f, then 

(4.6) J fdn = lim J /„ dp. 

proof. According to Corollary 2.10, the function /is measurable. 
Since /„ < /„+i < /, it follows from Lemma 4.5(a) that 

j fn d/A ^ j fn + ldfA ^ / dfl 

for all n e N. Therefore we have 

lim J /„ dfA < J fdfA. 

To establish the opposite inequality, let a be a real number satisfying 
0 < a < 1 and let y be a simple measurable function satisfying 
0 < y < /. Let 

A n = {x e X : ffx) 2s a<p(x)} 
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sothat A n eX,A n c A n + 1 , and X = (J A n . According to Lemma 4.5, 
(4.7) f a<p dp sc f f n dp ^ f fn dp, 

•M n J A n J 

Since the sequence ( A n ) is monotone increasing and has union X, it 
follows from Lemmas 4.3(b) and 3.4(a) that 



Therefore, on taking the limit in (4.7) with respect to n, we obtain 
a j<p dfjt, < lim j f n dix. 

Since this holds for all a with 0 < a < 1 , we infer that 



and since <p is an arbitrary simple function in M + satisfying 0 < <p ^ /, 
we conclude that 



If we combine this with the opposite inequality, we obtain (4.6). q.e.d. 

Remark. It should be observed that it is not being assumed that 
either side of (4.6) is finite. Indeed, the sequence (J/ n dp) is a mono- 
tone increasing sequence of extended real numbers and so always has 
a limit in R, but perhaps not in R. 

We shall now derive some consequences of the Monotone Con- 
vergence Theorem. 


4.7 Corollary. 
to M + and 


(a) Iff belongs to M + and c ^ 0, then cf belongs 


j cf dp = c J 


fdp. 


(b) Iff, g belong to M + , then f + g belongs to M + and 

J(/ + g) dp = J fdp + jgdp. 
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proof, (a) If c = 0 the result is immediate. If c > 0, let (<p n ) be 
a monotone increasing sequence of simple functions in M + converging 
to / on X (see Lemma 2.11). Then (c<p n ) is a monotone sequence 
converging to cf. If we apply Lemma 4.3(a) and the Monotone 
Convergence Theorem, we obtain 



= c 



(b) If (<p„) and (ifi n ) are monotone increasing sequences of simple 
functions converging to / and g, respectively, then (<p n + <fi n ) is a 
monotone increasing sequence converging to f + g. It follows from 
Lemma 4.3(a) and the Monotone Convergence Theorem that 


j(f+ g) = lim J(<p n + >pn) d/j. 

= lim jcp n d[x + lim J </i n d/j. 

= jfdp + jgdp. 


Q.E.D. 


The next result, a consequence of the Monotone Convergence 
Theorem, is very important for it enables us to handle sequences of 
functions that are not monotone. 

4.8 Fatou’s Lemma. belongs to M + (X, X ), then 

(4.8) J (lim inf f n ) df± < lim inf J f n dp.. 


proof. Let g m = inf{/ m ,/ m + i , . . .} so that g m ^ /„ whenever 
m n. Therefore 

jgmdfj. ^ j f n dfj,, m^n, 

so that 

jgm d[>. < lim inf J /„ dp. 

Since th6 sequence (g m ) is increasing and converges to lim inf /„, the 
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Monotone Convergence Theorem implies that 

dft = lim jg m dp 

5% lim inf J f n d \p. q.e.d. 

It will be seen in an exercise that Fatou’s Lemma may fail if it is not 
assumed that /„ > 0. 

4.9 Corollary. If f belongs to M + and if A is defined on X by 

(4-9) m = f / djx, 

Je 

then X is a measure. 



proof. Since / 5= 0 it follows that X(E) ^ 0. If E = 0, then f Xs 
vanishes everywhere so that A(0) = 0. To see that A is countably 
additive, let (£ n ) be a disjoint sequence of sets in X with union E and 
let f n be defined to be 

n 

fn — 2 f XE k ■ 
k = l 

It follows from Corollary 4.7(b) and induction that 

( fn dfj- = 2 f f Xe k dfi = ^ HEf). 

•1 k = lj k = 1 

Since (/J is an increasing sequence in M + converging to f Xs , the 
Monotone Convergence Theorem implies that 


ME) 


J 


f Xe dfi = lim 


t* oo 

2 

J k=l 


ME k ). 


Q.E.D. 


4.10 Corollary. Suppose that f belongs to M + . Then f(x) = 0 
p-almost everywhere on X if and only if 


(4.10) 


J f dp = 0. 


proof. If equation (4.10) holds, let 

E n = \xeX-. f(x) > ij-. 
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so that / ^ (1 /tj) xe„ » from which 

0 = Jfdp>± p(E n ) > 0. 

It follows that p(E n ) = 0; hence the set 

{xe X : f(x) > 0} = Q E n 

n- 1 

also has measure 0. 

Conversely, let f{x) = 0 /x-almost everywhere. If 
E = {xeX:f(x) > 0}, 

then p(E) = 0. Let f n = hxe- Since / ^ lim inf /„, it follows from 
Fatou’s Lemma that 


0 < j f dp < lim inf J /„ dn = 0. 


4.11 Corollary. Suppose that f belongs to M + , and define X on X 
by equation (4.9). Then the measure A is absolutely continuous with 
respect to p in the sense that if EeX and p(E) = 0, then X(E) = 0. 

proof. If p(E) = 0 for some EeX, then fx K vanishes /^.-almost 
everywhere. By Corollary 4.10, we have 


K E ) = J fxB dp = 0. 


Q.E.D. 


We shall now show that the Monotone Convergence Theorem holds 
if convergence on X is replaced by almost everywhere convergence. 

4.12 Corollary. If (/„) is a monotone increasing sequence of 
functions in M + (X, X) which converges p-almost everywhere on X to 
a function f in M + , then 

J fdp = lim J f n dp. 

proof. Let N € X be such that p(N) = 0 and (/„) converges to f 
at every point of M = X\ N. Then (f n Xm ) converges to f Xu on X, 
so the Monotone Convergence Theorem implies that 


J fXM dp = lim J f n Xm dp. 
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Since p{N) = 0, the functions f Xn and /„ Xn vanish ^-almost every- 
where. It follows from Corollary 4.10 that 

J" f Xn dp = 0, j fn Xn dp = 0. ,• 

Since / = f Xm +/xn and f n = f n Xm + /» Xn, it follows that 

J 7 *-J fxu dp = lim J f n xm dp = lim J f n dp. q.e.d. 

4.13 Corollary. Let ( g n ) be a sequence in M + , then 

/(£*)*-.? i(l*4 

proof. Let f n = gi H 1- g n , and apply the Monotone Con- 
vergence Theorem. q.e.d. 

EXERCISES 

4. A. If the simple function <p in M + (X, X) has the (not necessarily 
standard) representation 

771 

= ; .ton 

k = 1 

where b k e R and F k e X, show that 

/* m 

\<pdp = 2 b k p(F k ). 

J k=l 

4.B. The sum, scalar multiple, and product of simple functions are 
simple functions. [In other words, the simple functions in M(X, X) 
form a vector subspace of M(X,X), closed under products.] 

4.C. If <px and <p 2 are simple functions in M(X, X), then 
Ip = sup {<P! ,(p 2 ), CD = inf { 9 0 1 , cp 2 } 
are also simple functions in M(X, X) . 

4.D. If feM + and c > 0, then the mapping cp — > ip — cep is a 
one-to-one mapping between simple functions cp 6 M + with cp ^ / 
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and simple functions tfi in M + with >jj < cf. Use this observation to 
give a different proof of Corollary 4.7(a). 

4.E. Let f,g e M + , let w € M + be a simple function such that 
to < / + 9 and let (p n (x ) = sup{(m/n)ui(x) : for 0 ^ m ^ n with 
(m/n)w(x) ^ /(x)}. Also let rp n (x) = sup{(l — l/n)w(x) — <^ n (x),0}. 
Show that (1 - 1 /n)u> <p n +ip n and tp„ ^ /, V’n ^ 9- 

4.F. Employ Exercise 4.E to establish Corollary 4.7(b) without 
using the Monotone Convergence Theorem. 

4.G. Let X = N, let X be all subsets of N, and let 9 be the counting 
measure on X. If/is a nonnegative function on N, then / e M + (X, X) 
and 

f /<**« 2 An). 

J n = 1 

4.H. Let X = R, X = B, and let A be the Lebesgue measure on B. 
If f n - Xio.m, then the sequence is monotone increasing to / = xio. + «)• 
Although the functions are uniformly bounded by 1 and the integrals 
of the /„ are all finite, we have 

J fdX — + 00 . 

Does the Monotone Convergence Theorem apply ? 

4.1. Let X = R, X = B, and A be Lebesgue measure on X. If 
fn - (1 /«) X[n, +«>, then the sequence (/„) is monotone decreasing and 
converges uniformly to / = 0, but 

0 = J f d\ 7^ lim j f„ dX — +co . 

(Hence there is no theorem corresponding to the Monotone Con- 
vergence Theorem for a decreasing sequence in M + .) 

4.J.(a) Let /„ = ()/n) x [0 , n] , / = 0. Show that the sequence (/ n ) 
converges uniformly to /, but that 

J fdX ^ lim j f n d\. 

Why does this not contradict the Monotone Convergence Theorem? 
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Does Fatou’s Lemma apply ? 

(b) Let g n = n xwn.ztm, g = 0. Show that 

jgdX f lim jg n d\. 

Does the sequence ( g n ) converge uniformly to gl Does the Monotone 
Convergence Theorem apply? Does Fatou’s Lemma apply? 

4.K. If (X, X, fj . ) is a finite measure space, and if (/„) is a real-valued 
sequence in M + (X, X) which converges uniformly to a function /, 
then /belongs to M + (X, X ), and 



4.L. Let AT be a finite closed interval [a, b] in/?, let A" be the collection 
of Borel sets in X, and let A be Lebesgue measure on X. If / is a 
nonnegative continuous function on X, show that 

J f dX = J ^ dx ’ 

where the right side denotes the Riemann integral of /. {Hint: First 
establish this equality for a nonnegative step function, that is, a linear 
combination of characteristic functions of intervals.) 

4.M. Let X = [0, +oc), let X be the Borel subsets of X, and let A 
be Lebesgue measure on X. If /is a nonnegative continuous function 
on X, show that 

[ f d\ = lim f f{x) dx. 

J &-► + oo Jo 

Hence, iff is a nonnegative continuous function, the Lebesgue and the 
improper Riemann integrals coincide. 

[The next three exercises deal with the integration of functions which 
do not belong to M + . They can be omitted until the next chapter has 
been read. However, we include them here because they illustrate the 
restrictions required by Fatou’s Lemma.] 
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4.N. If f n = (- 1/«) X[o.ni> then the sequence (/„) converges uniformly 
to / = 0 on [0,co). However, f f n dX = - 1 whereas \ f d\ = 0, so 


lim inf J f n d\ = — 1 < 0 = J/dA. 


Hence Fatou’s Lemma 4.8 may not hold unless ^ 0, even in the 
presence of uniform convergence. 

4.0. Fatou’s Lemma has an extension to a case where the f n take 
on negative values. Let h be in M + (X,X), and suppose that 
j h dp < +oo . If (/ n ) is a sequence in M(X , X ) and if -h < /„, then 

J (lim inf /„) dp < lim inf J f n dp . 

4.P. Why doesn’t Exercise 4.0 apply to Exercise 4.N? 

4.Q. If /e M + (X, X) and 

J fdp < +co, 

then p{xeX: f(x) = +oo} = 0 . [Hint: If E n = {x e X : f(x) > n) , 

then n XEn < /.] 

4.R. lifeM + (X, X)and 

jfdp < +oo, 


then the set N = {x e X : f(x) > 0} is cr-finite (that is, there exists a 
sequence ( F n ) in X such that N s (J F n and p(F n ) < +oo). 

4.S. If feM + (X, X) and 

j fdp < +oo. 


then for any s > 0 there exists a set Ee X such that p(E) < +oo and 

J/4 1 < J fdp + t. 

4.T. Suppose that (/„) C M + ( X, X), that (/„) converges to/, and 
that 

f fdp = lim [ f n dp < +oo. 
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Prove that 


for each EeX. 


f f d E = Una f f n dfx 

Je Jb 


4.U. Show that the conclusion of Exercise 4.T may fail if the 
condition 


lim J /„ dfx 


< +oo 


is dropped. 



CHAPTER 5 


Integrable Functions 


In Definition 4.4 we defined the integral of each function in 
M + — M + (X,X) with respect to a measure p and permitted this 
integral to be +co. In this chapter we shall discuss the integration 
of measurable functions which may take on both positive and negative 
real values. Here it is more convenient to require the values of the 
functions and the integral to be finite real numbers. 

5.1 Definition. The collection L = L(X, X, p) of integrable (or 
summable) functions consists of all real-valued AT-measurable functions 
/ defined on X, such that both the positive and negative parts / + ,/~, 
of/ have finite integrals with respect to p. In this case, we define the 

integral of / with respect to p to be 

( 5 - 1 ) jfdp = jf + dp- jf-dp. 

If E belongs to X, we define 

( f dp = \ f + dp - f f- dp. 

Je Je Je 

Although the integral of / is defined to be the difference of the 
integrals of it is easy to see that if / = /i ~/ 2 where f x , f 2 are 
any nonnegative measurable functions with finite integrals, then 

jfdp = J/i dp - j f 2 dp. 


41 
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In fact, since / + -/“=/ = /i - / 2 , it follows that/ + +f 2 =fi + 
f~. If we apply Corollary 4.7(b), we infer that 

J/ + dii + J/ a 4* = J/i dp + J /' dp. 

Since all these terms are finite, we obtain 

J7<^ = J7 + 4*- J/"4t- ffidp- jfidp. 

5.2 Lemma. If /belongs to L and A w defined on X to R by 
(5-2) A (£) = f fdp, 

JE 

then A is a charge. 

proof. Since f + and/ - belong to M + , Corollary 4.9 implies that 
the functions A + and A", defined by 

A + (E) = f / + 4*, A-(£) = f /- 4*, 

are measures on AT; they are finite because fsL. Since A = A + - A - , 
it follows that A is a charge. q.e.d. 

The function A defined in (5.2) is frequently called the indefinite 
integral of/ (with respect to y.). Since A is a charge, if (E n ) is a disjoint 
sequence in X with union E, then 

f /*-2 f /*. 

Je 7l = l J E n 

We refer to this relation by saying that the indefinite integral of a function 
in L is countably additive. 

The next result is sometimes referred to as the property of absolute 
integr ability of the Lebesgue integral. The reader will recall that, 
although the absolute value of a (proper) Riemann integrable function 
is Riemann integrable, this may no longer be the case for a function 
which has an improper Riemann integral (for example, consider 
f{x) - x" 1 sin x on the infinite interval 1 < x < +oo). 
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5.3 Theorem. A measurable function f belongs to L if and only if 
|/| belongs to L. In this case 

(5.3) |J/4*|< Jl/14*. 

proof. By definition /belongs to L if and only if/ + and /~ belong 
to M + and have finite integrals. Since |/| + = |/| =/ + + f~ and 
| f\~ =0, the assertion follows from Lemma 4.5(a) and Corollary 
4.7(b). Moreover, 

iJ/d/i | = | j f + dp - Jf~ dp 

< J/ + dp + Jf~ dp = J |/| dp. Q.E.D. 

5.4 Corollary. Iff is measurable, g is integrable, and \ f\ ^ jg| , 
then f is integrable, and 

J* I/I 4* < J|g| dp. 

proof. This follows from Lemma 4.5(a) and Theorem 5.3. q.e.d. 

We shall now show that the integral is linear on the space L in the 
following sense. 

5.5 Theorem. A constant multiple af and a sum f + g of functions 
in L belongs to L and 

Jot/ dp. = a J fdp, j(f + g) dp. = J f dp + jg dp. 
proof. If a = 0, then af = 0 everywhere so that 
Jot/ dp = 0 = a J f dp. 

If a > 0, then (af ) + = a f + and (a/) - = a f~, whence 
jafdp = jaf + dp - jaf- dp 

= “ { J/ + d P ~ J/' = “ J 

The case a < 0 is handled similarly. 
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If / and g belong to L, then |/| and |g| belong to L. Since 
1/ + g| s? |/| + |g| it follows from Corollaries 4.7 and 5.4 that f + g 
belongs to L. To establish the desired relation, we observe that 

f + g = (f + + g + ) ~ if~ + g~). 

Since f + + g + and f~ + g~ are nonnegative integrable functions, it 
follows from the observation made after Definition 5.1 that 

/(/+ g)<k = J(/ + + g + )dn - J(/- + g~)dp. 

If we apply Corollary 4.7(b) and rearrange the terms, we obtain 

J(/+ g) dfx = j f + dn - j f~ dp + Jg + dix - jg~ dix 

= jfdp + jgdp. Q.e.d. 

We shall now establish the most important convergence theorem 
for integrable functions. 

5.6 Lebesgue Dominated Convergence Theorem. Let (/„) be a 
sequence of integrable functions which converges almost everywhere to a 
real-valued measurable function f. If there exists an integrable function 
g such that |/„| < g for all n, then f is integrable and 

(5.4) jfdp = lim jfndfc. 


proof. By redefining the functions /„,/ on a set of measure 0 we 
may assume that the convergence takes place on all of X. It follows 
from Corollary 5.4 that / is integrable. Since g + f n > 0, we can 
apply Fatou’s Lemma 4.8 and Theorem 5.5 to obtain 


jg^T + jfdn = jig +f)d>x ^ lim inf J (g + /„) d/i 
= lim inf ( Jg dp + J/„ dg^ 

= jg dy- + lim inf J /„ d^. 
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Therefore, it follows that 


(5.5) 


J f dp. ^ lim inf j f n dp. 


Since g — /„ > 0, another application of Fatou’s Lemma and Theorem 
5.5 yields 

J# dn - J fdfi = J(g - /) dp < lim inf J(g - /„) dp. 

^ jg dp- lim sup J /„ dp, 
from which it follows that 
(5.6) lim sup J/ n < J f dp. 

Combine’ (5.5) and (5.6) to infer that 

J f dp = lim j f n d/x. q.e.d. 


DEPENDENCE ON A PARAMETER 

Frequently one needs to consider integrals where the integrand 
depends on a real parameter. We shall show how the Lebesgue 
Dominated Convergence Theorem can be used in this connection. 

For the remainder of this chapter we shall let / denote a function 
defined on X x [a, ft] to R and shall assume that the function x — > 
f(x, t) is A-measurable for each t e [a, b\. Additional hypotheses will 
be stated explicitly. 

5.7 Corollary. Suppose that for some t 0 in [a, b ] 

(5.7) f(x, t 0 ) = lim f(x, t ) 

(-<0 

for each x e X, and that there exists an integrable function g on X such 
that | f(x, t)\ g(x). Then 

j fix, t 0 ) dix(x) = lim I f(x, t) dp(x). 

J t->t 0 J 
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proof . Let ( t n ) be a sequence in [a, b] which converges to t 0 , and 
apply the Dominated Convergence Theorem to the sequence ( /„ ) 
defined by f n {x) = f(x, t n ) for xeX. q . e . d . 

5.8 Corollary. If the function t f(x, t) is continuous on [ a,b ] 
for each x e X, and if there is an integrable function g on X such that 
| f(x, 0| < g(x), then the function F defined by 

( 5 - 8 ) F(t) = jf(x,t)di4x) 

is continuous for t in [a,b\. 

proof . This is an immediate consequence of Corollary 5.7. q . e . d . 

5.9 Corollary. Suppose that for some t 0 e [a, b\, the function 
x ->/(*, t 0 ) is integrable on X, that df/dt exists on X x [a,b], and that 
there exists an integrable function g on X such that 

t) < g(x). 

Then the function F defined in Corollary 5.8 is differentiable on [a, b ] and 

Tt (') = Jt / ^ X ’ Q d ^ x) = fdt (x> ^ d/dx ) ■ 

proof . Let t be any point of [ a, b ]. If ( t n ) is a sequence in [a, b] 
converging to t with t n t, then 


; ( x , t) = lim 


/(■*, t n ) ~ f(x, Q 


xe X. 


Therefore, the function a -> (8f/dt)(x, t) is measurable. 

If x e X and t e [a, b\, we can apply the Mean Value Theorem (see 
Reference [1], page 210) to infer the existence of a s x between t 0 and t 
such that 

f(x, t) - f{x, t 0 ) = {t - t 0 )j t (x, sf). 


Therefore we have 


I fix, 01 < | fix, t 0 )| + \t — t 0 1 g(x), 
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which shows that the function x — > f(x, t) is integrable for each t in 
[a,b]. Hence, if t n ^ t, then 

F(t n ) - F(t) = f f(x, Q - /(x, t) 
t n t J t n t 

Since this integrand is dominated by g(x), we may apply the Dominated 
Convergence Theorem to obtain the stated conclusion. q.e.d. 

5.10 Corollary. Under the hypotheses of Corollary 5.8, 

J F(t) dt = J |J /(x, t ) 4 t(x)j dt 

= J[£7(*,0<a] <H*), 

where the integrals with respect to t are Riemann integrals. 
proof. Recall that if <p is continuous on [a, b\ then 
d f* 

j t J 95 ( 5 ) ds = <p(t), a t < b. 

Let h be defined on X x [a, b] by 

h(x,t)= f f(x,s)ds. 

J a 

It follows that ( dh/dt)(x , t) = /(x, t). Since this Riemann integral 
exists, it is the limit of a sequence of Riemann sums ; hence the map 
x h(x, t ) is measurable for each t. Moreover, since |/(x, ?)| < g(x), 
we infer that \h(x, r)| ^ g(x)(b - a), so that the function x -> h(x, t) 
is integrable for each t e {a, b]. Let H be defined on [a, b ] by 

H (f) = J h(x, t ) dfi{x ) ; 
it follows from Corollary 5.9 that 


lit ^ = Jl? ^ = fft x ’ ^ d ^ = F ®' 
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Therefore we have 


P F(t) dt = H(b) - H(a) 

J a 

= J [h{x, b) - h(x, a)] d/4x) 

= J[J 

Q.E.D. 

The interchange of the order of (Lebesgue) integrals will be considered 
in Chapter 10. 

EXERCISES 

5. A. lffeL(X, X, /x) and a > 0, show that the set {xe X : |/(x)| > 
a) has finite measure. In addition, the set {x e X : f{x) A 0} has 
<r-finite measure (i.e., it is the union of a sequence of measurable sets 
with finite measure). 

5.B. If / is an X-measurable real-valued function and if f(x) = 0 for 
ft-almost all x in X, then / e L(X, X, /x) and 

jfdti = 0 . 

5.C. If/e L(X, X, 4) and g is an A-measurable real-valued function 
such that f(x) = g(x) almost everywhere on X, then g e L(X, X, g.) and 

j f dfl = j gdH " 

5.D. If / e L(X, X, g.) and e > 0, then there exists a measurable 
simple function <p such that 

j \f - <p\ dg- < s- 

5.E. If fsL and g is a bounded measurable function, then the 
product fg also belongs to L. 

5.F. If /belongs to L, then it does not follow that f 2 belongs to L. 
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5.G. Suppose that /is in L(X, X, /j.) and that its indefinite integral is 

HE) = J / dfj., EeX. 

Show that A(£) ^ 0 for all £ e A' if and only if f(x) ^ 0 for almost all 
x £ X. Moreover, A(£) = 0 for all E if and only if /(x) = 0 for almost 
all x e X. 

5.H. Suppose that f x and / 2 are in L(X, X, /z) and let A : and A 2 be 
their indefinite integrals. Show that A X (E) = A 2 (£) for all EeX if 
and only if //x) = / 2 (x) for almost all x in X. 

5.1. If / is a complex-valued function on X such that Re / and Im / 
belong to L(X, X, /z) , we say that / is integrable and define 

J / dfx = J Re / dfjt + / Jim/ dfx. 

Let / be a complex-valued measurable function. Show that / is 
integrable if and only if |/| is integrable, in which case 

|J/4z| < J|/| dfi. 

[Hint: If J/d/z = r e i0 with r, 6 real, consider g(x) = e~ ie f(x).] 

5.J. Let ( /„ ) be a sequence of complex-valued measurable functions 
which converges to /. If there exists an integrable function g such 
that |/ n | < g, show that 



5.K. Let X = N, let X be all subsets of N, and let /x be the counting 
measure on X. Show that / belongs to L(X, X, /z) if and only if the 
series 2/00 is absolutely convergent, in which case 

f /4* = 2 /(»)• 

J n = 1 

5.L. If (/„) is a sequence in L(X, X, p) which converges uniformly 
on J to a function /, and if fx(X) < +oo, then 
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5.M. Show that the conclusion in the Exercise 5.L may fail if the 
hypothesis n(X) < +oo is dropped. 

5.N. Let/ n = n xto.i/nii where X = R, X = B, and fi is Lebesgue 
measure. Show that the condition |/ n | < g cannot be dropped in the 
Lebesgue Dominated Convergence Theorem. 

5.0. If /„ e L(X, X, n), and if 

00 

n = 1 

then the series 2/ n (x) converges almost everywhere to a function / in 
L(X, X, n) . Moreover, 

5.P. Let f n e L(X, X, /j.), and suppose that (/„) converges to a 
function /. Show that if 

lim'J | /„ - f\ dp = 0, then J |/| dp = lim J|/ n | dp. 


jia 


dfj, < +oo. 


5.Q. If t > 0, then 


/. 


+ 00 1 
e~ tx dx = - 
o t 


Moreover, if t ^ a > 0, then e~ tx ^ e~ ax . Use this and Exercise 
4.M to justify differentiating under the integral sign and to obtain 
the formula 


r + co 

Jo 


x n e 


dx = n\ 


5.R. Suppose that / is defined on X x [a, b] to R and that the 
function x->f(t,x ) is A'-measurable for each te[a,b\. Suppose 
that for some t 0 and t l in [a, b ] the function x -> f{x, t 0 ) is integrable 
on X, that ( dfjdt)(x , tf) exists, and that there exists an integrable 
function g on X such that 


fix, t) - f{x, fi) 
t-h 


< g(x) 


for xe X, and t e [a, b ], t / t x . Then 


j t J fix, t ) dn(x) ^ = 'jif t ix, h) dft(x). 
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5.S. Suppose the function x^-f(x,t) is A'-measurable for each 
t e R, and the function t -> f(x, t) is continuous on R for each x e X. 
In addition, suppose that there are integrable functions g, h on X such 
that | f(x, t)\ ^ g(x) and such that the improper Riemann integral 

j* \f(x, 0| dt h(x). 

J — 00 

Show that 

j: [j ax, t) ^ = j [ j + * ax, t ) dt ] dvtx), 

where the integrals with respect to t are improper Riemann integrals. 

5.T. Let /be an AT-measurable function on X to R. For ne N, let 
(/ n ) be the sequence of truncates of / (see Exercise 2.K). If / is 
integrable with respect to fi, then 

J/4* = lim J*/ B dfx. 

sup f |/nl d^ < +CO, 




'■* (2 


Conversely, if 
then / is integrable. 
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CHAPTER 6 


The Lebesgue spaces L p 


It is often useful to impose the structure of a Banach space on the 
set of all integrable functions on a measure space ( X , X, /x) . In 
addition, we shall introduce the L P , 1 < p ^ oo, spaces which occur 
frequently in analysis. Aside from the intrinsic importance of these 
spaces, we examine them here partly to indicate applications of some 
of the results in the earlier sections. 

6. 1 Definition. If V is a real linear ( = vector) space, then a real- 
valued function N on V is said to be a norm for V in case it satisfies 

(i) N(v) > 0 for all v e V; 

(ii) N(v) = 0 if and only if v = 0; 

(iii) N(av ) = |«| A^(u) for all v e V and real a; 

(iv) N(u + v) ^ N(u) + N(v) for all n, v e V. 

If condition (ii) is dropped, the function N is said to be a semi-norm or a 
pseudo-norm for V. A normed linear space is a linear space V together 
with a norm for V. 

6.2 Examples, (a) The absolute value function yields a norm for 
the real numbers. 

(b) The linear space R n of n-tuples of real numbers can be normed 
by defining 

Ai( «j, . - . , h„) = |wi| + • • • 4- \u n | , 

A p («i, . . . , m„) = (|Uip + • • • + \u n \ p } llp , p > 1, 

Nao(Ul, ...,u n ) = sup (iMil , . . . , K|}. 
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It is easy to check that JV a and N m are norms and that N p satisfies (i), 
(ii), (iii). It is a consequence of Minkowski’s Inequality, which will 
be proved subsequently, that N p satisfies (iv). 

(c) The linear space / of all real-valued sequences u = («„) such 
that N t (u) = 2 |m„| < +oo is a normed linear space under N 1 . 
Similarly, if 1 < p < oo, the collection l p of all sequences such that 
N p (u) — {2 \u n \ p } llp < +co is normed by N„. 

(d) The collection B(X) of all bounded real-valued functions on X 
is normed by 

N(f) = sup{|/(x)| : xeX}. 

In particular, the linear space of continuous functions on X = [a, b\ is 
normed. 

All the preceding examples have been proper norms on a linear 
space. There are also semi-norms on a linear space that are of interest. 
The following are some examples. 

6.3 Examples, (a) On the space R n , consider the semi-norm 

JVo(Ki,...,w n ) = sup{|« 2 |,. . . , |w B |}. 

Here N 0 (u 1 , ...,«„) = 0 if and only if u 2 = • • • = u n = 0. 

(b) On the linear space C[0, 1] of continuous functions on [0, 1] to 
R, define the semi-norm 

N 0 (f) = sup{|/0)| : 0 sS x i). 

Here N 0 (f) = 0 if and only if f{x) vanishes for 0 ^ x ^ \ . 

(c) On the linear space of functions on [a, b ] to R which have con- 
tinuous derivatives, consider the semi-norm 

N 0 (f) = sup{|/'(x)| : a s£ x =* b}. 

Here N 0 (f ) = 0 if and only if /is constant on [a,b\. 

6.4 Definition. Let (X, X, p.) be a measure space. If /belongs to 
L(X, X, p), we define 

N u (f) = J |/| 

It will be shown that N u is a semi-norm on the space L(X, X, /n) . 
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6.5 Lemma. The space L(X, X, p) is a linear space under the 
operations defined by 

if + g) (X) = f(x) + g(x), (ccf)(x) = af{x), X E X , 

and N u is a semi-norm on L(X, X, p ) . Moreover, N u (f) = 0 if and 
only if f(x) = 0 for p-almost all x in X. 

proof. It was seen in Theorem 5.5 that L = L( X, X, p) is a linear 
space under the indicated operations. It is clear that N u (f) ^ 0 for 
/ sL, and that 

Nfytf) = J | “/I dp. = |<x| J |/| dp = M #„(/). 

Moreover, it follows from the Triangle Inequality that 

K(f+g) = j\f+g\dp<z J(|/| + \ g \)dp 

~ j\f\dd + j\g\dp = N u (f ) + N u (g). 

Hence is a semi-norm on L, and it follows from Corollary 4.10 that 
N„(f) = 0 if and only if f{x) = 0 for almost all x. q.e.d. 

In order to make L{X, X, p) into a normed linear space, we shall 
identify two functions that are equal almost everywhere; that is, we use 
equivalence classes of functions instead of functions. 

6.6 Definition. Two functions in L = L{X, X, p) are said to be 
/x-equivalent if they are equal /x-almost everywhere. The equivalence 
class determined by / in L is sometimes denoted by [/] and consists of 
the set of all functions in L which are /^-equivalent to /. The Lebesgue 
space Li = LfX, X, p) consists of all /^-equivalence classes in L. If 
[/] belongs to L 1 , we define its norm by 

(6.1) IIL/llli = Jl/I dp. 

6.7 Theorem. The Lebesgue space LfX, X, p) is a normed linear 
space. 
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proof. It is understood, of course, that the vector operations in L x 
are defined by 

«[/] = [«/], If] + [g] = If + gh 

and that the zero element of L 1 is [0]. We shall check only that 
equation (6.1) gives a norm on L 1 . Certainly |[/]]|i ^ 0 and ||[0]|| 1 = 
0. Moreover, if |[/]||i = 0 then 

Jl/l4* = o, 

so f{x) = 0 for /^-almost all *. Hence [/] = [0] . Finally, it is easily 
seen that properties (iii) and (iv) of Definition 6.1 are satisfied. There- 
fore || || ! yields a norm on L x . q.e.d. 

It should always be remembered that the elements of L x are actually 
equivalence classes of functions in L. However, it is both convenient 
and customary to regard these elements as being functions, and we 
shall subsequently do so. Thus we shall make reference to the 
equivalence class (/] by referring to “the element / of L x ,” and we shall 
write l/ll ! in place of ||[/]||i. 


THE SPACES L p , 1 < p< + oo 

We now wish to consider a family of related normed linear spaces of 
equivalence classes of measurable functions. 

6.8 Definition. If 1 ^ p < oo, the space L v = L P (X, X, p) con- 
sists of all /^-equivalence classes of ^-measurable real-valued functions 
/ for which |/| p has finite integral with respect to p over X . Two 
functions are /(.-equivalent if they are equal /(.-almost everywhere. We 
set 

(6-3) \\fh = {j\f\’dpy P . 

If p = 1 , this reduces to the norm introduced previously on the 
space L x of equivalence classes of integrable functions. We shall show 
subsequently that if 1 < p < oo, then L p is a normed linear space 
under (6.3), and is complete under this norm; thus L v is a Banach 
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space. It is understood that the vector operations between the 
equivalence classes in L p are defined pointwise: the sum of the equiv- 
alence classes containing / and g is the equivalence class containing 
f + g and similarly for the product cf. 

In the special case where p is the counting measure on all subsets of 
N, the Lp-spaces can be identified with the sequence spaces l p of 
Example 6.2(c). In this case, each equivalence class contains one 
element. It is frequently enlightening to interpret assertions about 
general L p -spaces by considering the somewhat simpler /^-spaces. 

In order to establish that (6.3) yields a norm on L p , we shall need 
the following basic inequality. 

6.9 Holder’s Inequality. Let feL p and ge L„ where p > 1 and 
(VP) + 0/?) = 1- Then fg e L 1 and \\fg\l, ^ ||/|| p ||g|| 9 . 

proof. Let a be a real number satisfying 0 < a < 1 , and consider 
the function <p defined for r ^ 0 by 

<p(t) = at — t a . 

It is easy to check that <p'(t) < 0 for 0 < t < 1 and <p'(t) > 0 for t > 1 . 
It follows from the Mean Value Theorem of calculus that <p(/) ^ ip(l) 
and that <p(t) = <p(l), if and only if t = 1 . Therefore we have 

t a < at + (1 - a), t Ss 0. 

If a, b are nonnegative, and if we let t = a/b and multiply by b, we 
obtain the inequality 

a a b 1 ~ a < aa + (1 — a)b, 

where equality holds if and only if a = b. 

Now let p and q satisfy 1 < p < co and (1/p) + (1 jq) = 1 and take 
a = 1/p. It follows that if A,B are any nonnegative real numbers, 
then 

(6.4) AB ^ — + — . 

p q 

and that the equality holds if and only if A p = B q . 

Suppose that feL p and geL q , and that ||/|| p f 0 and HgH, f 0. 
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The product fg is measurable and (6.4) with A = \f(x)\/\\f\\ p and 
B = kWI/lkL implies that 

j/WgWI . I/M|f . 

II/IIp IkL ^ p\\fh v wllsll/ 


Since both of the terms on the right are integrable, it follows from 
Corollary 5.4 and Theorem 5.5 that fg is integrable. Moreover, on 
integrating we obtain 


ll/glli 

11/IUkL 


P <7 


which is Holder’s Inequality. q.e.d. 

Holder’s Inequality implies that the product of a function in L p and 
a function in L q is integrable when p > 1 and q satisfies the relation 
(l// 7 ) + (1 A?) = 1 or, equivalently, when p + q — pq . Two numbers 
satisfying this relation are said to be conjugate indices. It will be noted 
that p = 2 is the only self-conjugate index. Thus the product of two 
functions in L 2 is integrable. 


6.10 Cauchy-BunyakovskiI-Schwarz Inequality. If f and g 
belong to L 2 , then fg is integrable and 

(6-5) |J7*«fr| < \\fg\dp ^ ||/|| 2 ||g|| 2 . 

6.11 Minkowski’s Inequality. If f and h belong to L v , p > 1, 
thenf + h belongs to L p and 

( 6 -6) «/+%< II/IIp + IHIp- 

proof. The case p = 1 has already been treated, so we suppose 
P > 1- The sum/ + h is evidently measurable. Since 

|/+ hf ^ [2 sup (|/| ,|A|}] P < 2*’{|/| p + \hf} 

it follows from Corollary 5.4 and Theorem 5.5 that/ + h e L v . More- 
over, 


(6.7) |/+ h\* = |/+ h\ |/+ h I*- 1 < |/| |/+ h \»~ 1 + \h\ \f+ hf~\ 
Since/ + heL p , then | / -f h\ p e L x ; since p = (p — \)q it follows that 
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I / + h\ p ~ 1 eL q . Hence we can apply Holder’s Inequality to infer that 

Jl/I \f+h\-'dn < ||/|| P {J|/+/ I | <p - 1) ‘'^} 1/5 
= I/I, 11/ + h\\/‘ q - 

If we treat the second term on the right in (6.7) similarly, we obtain 
11/ + hl P < |/||, 11/ + h\\/'« + ||A|| P |/+ All/'’ 

-{«/!*+ \m\\f+h\\ P »'. 

If A. — ||/+ A ||p = 0, then equation (6.6) is trivial. If A =£ 0, we 
can divide the above inequality by A p/ ’; since p - p/q = 1 , we obtain 
Minkowski’s Inequality. q.e.d. 

It is readily seen that the space L p is a linear space and that formula 
(6.3) defines a norm on L v . The only nontrivial thing to be checked 
here is the inequality 6.1(iv) and this is Minkowski’s Inequality. We 
shall now show that L p is complete under this norm in the following 
sense. 

6.12 Definition. A sequence (/„) in L p is a Cauchy sequence in 
L v if for every positive number e there exists an M(z) such that if 
m,n > M(e), then \\f m — f n \ p < c. A sequence (/„) in L p is con- 
vergent to/ in L v if for every positive number e there exists an N(z) 
such that if n ^ N(t), then ||/ — f n \ p < e. A normed linear space is 
complete if every Cauchy sequence converges to some element of the 
space. 

6.13 Lemma. If the sequence (/„) converges to f in L p , then it is a 
Cauchy sequence. 

proof. If m, n is N(s/ 2), then 

!/-/-«,< 5, «/-/»«, <5. 

Hence we have 

||/m fn [|p < ll/m ~ /||p + ||/-/„||p < e. Q.E.D. 

We shall now show that every Cauchy sequence in L p converges in 
L p to an element. This result is sometimes called the Riesz-Fischer 
Theorem. 
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6.14 Completeness Theorem. If l < p < oo , then the space L p is a 
complete normed linear space under the norm 

II/IIp = d pf' V ' 

proof. It has been stated that L v is a normed linear space. To 
establish the completeness of L v , let (/„) be a Cauchy sequence relative 
to the norm || || p . Hence, if e > 0 there exists an M(s) such that if 
m, n ^ M(e), then 

(6.8) J I f m - A| p dp = II /. - All/ < s*. 

There exists a subsequence (g k ) of (/ n ) such that ||* fc+1 - g k || P < 2~ fc 
for k e N. Define g by 

(6.9) g(x) = |gj(x)| 4- 2 Iffc+iW - gk( x ')\, 

k = 1 

so that g is in M + (X, X). By Fatou’s Lemma, we have 

J |*|' dp < lim inf J||* a | + J] |* t+1 - * fc | j dp. 

Take the pth root of both sides and apply Minkowski’s Inequality to 
obtain 

{J l#l P 4*} < Hm inf 'Ifo + i - &| p | 

^ ll.gillp + 1 • 

Hence, if E = {xeX: g(x) < +oo}, then EeX and p(X\E) = 0. 
Therefore, the series in (6.9) converges almost everywhere and g xe 
belongs to L v . 

We now define / on X by 

oo 

fix) = Sx(x) + 2 {fo+iW - gk(x)}, xeE, 

*Tjj k = l 

» =0 > X $ E - 

^ nce ^ Iffil + X^=i lffi+i - 9j\ f g and since (g k ) converges 
almost everywhere to /, the Dominated Convergence Theorem 5.6 
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implies that feL p . Since \f — g k \ v < 2 p g p , we infer from the 
Dominated Convergence Theorem that 0 = lim \\f — g k || p , so that 
(g k ) converges in L p to /. 

In view of (6.8), if m ^ M(e) and k is sufficiently large, then 
Jl/m ~ gk\ p dp < e p . 

Apply Fatou’s Lemma to conclude that 

f I/m - f\ v dp. < lim inf f \f m - g k \ p dp. ^ e p , 

J k~* oo J 

whenever m ^ M( e). This proves that the sequence (f n ) converges to 
/ in the norm of L p . q.e.d. 

A complete normed linear space is usually called a Banach space 
Thus the preceding theorem could be formulated: the space L p is a 
Banach space under the norm given in (6.3). 


THE SPACE Loo 

We shall now introduce a space which is related to the L p -spaces. 

6.15 Definition. The space L„ = L„(A, X, if) consists of all the 
equivalence classes of A-measurable real-valued functions which are 
almost everywhere bounded, two functions being equivalent when they 
are equal ^-almost everywhere. If feL and NeX with p{N) = 0, 
we define 

S(N) = sup{j/(x)| : x$l V} 

and 

(6. 1 0) ll/ll „o = inf {S(N) : N e X, p(N) = 0} . 

An element of L„ is called an essentially bounded function. 

It follows (see Exercise 6.T) that if feL x , then |/(x)| ^ ||/||„ for 
almost all x. Moreover, if A < ||/||„o, then there exists a set E with 
positive measure such that |/(x)| ^ A for xe E. It is also clear that 
the norm in (6.10) is well-defined on L „ . 
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6.16 Theorem. The space L x is a complete normed linear space 
under the norm given by formula (6.10). 

proof. It is clear that L m is a linear space and that ||/||oo > 0, 
||0|U = 0, and ||«/||„ = |a| ||/||«,. If ||/||„ = 0, then there exists a 
set N k e X with p(N k ) = 0 such that |/(x)| < l/k for x £ AT*. If we put 
N = Ufc-i^k. then NeX, p(N) = 0, and \f(x)\ = 0 for x$N. 
Therefore, /(x) = 0 for almost all x. 

If f,gsL x , there exist sets N x , N 2 in X with ^(Afj) = p(N 2 ) = 0 
such that 

|/(*)l < 1/1 » for x$N lt 

U(x)| < for x ^ N 2 . 

Therefore |/(x) + g(x)| < ||/||a, + |g|U for x £ (A^ u N 2 ) , from 
which it follows that ||/+ g|j„ < ||/||„ + ]jg|» • 

It remains to prove that L x is complete. Let ( f n ) be a Cauchy 
sequence in L x , and let M be a set in X with p(M) = 0, such that 
|/n(x)| < [| fn || ao for X £ M, n = 1 , 2, . . . , and also such that |/ n (x) - 

/m(x)| < Wfn -/mil « for all x $ M, n, /w = 1,2, Then the 

sequence (/„) is uniformly convergent on X\M, and we let 

/(x) = lim/ n (x), x$M, 

= 0, xe M. 

It follows that /is measurable, and it is easily seen that \\f n — /||» -> 0. 
Hence L„ is complete. Q.E.D. 


EXERCISES 

6. A. Let C[0, 1] be the linear space of continuous functions on 
[0, 1] to R. Define N 0 for / in C[0, 1] by N 0 (f) = |/(0)| . Show that 
N 0 is a semi-norm on C[0, 1]. 

6.B. Let C[0, 1] be as before and define A/ for /in C[0, 1] to be the 
Riemann integral of |/| over [0, 1], Show that N\ defines a norm on 
C[0, 1], If /„ is defined for n > 1 to be equal to 0 for 0 < x 
(1 — l/n)/2, to be equal to 1 for \ < x < 1 , and to be linear for 
(1 — l/n)/2 < x ^ i, show that (/„) is a Cauchy sequence, but that it 
does not converge relative to A/ to an element of C[0, 1). 


I 
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6.C. Let TV be a norm on a linear space V and let d be defined for 
u,ve V by d{u, v) = N(u — v ). Show that d is a metric on V; that 
is, (i) d(u, v) ^ 0 for all u, v e V; (ii) d(u, v) = 0 if and only if u = v; 
(iii) d(u, v ) = d(v, u); (iv) d(u, v) < d(u, w) + d(w, v). 

6.D. If f e LfX, X, p.) and s > 0, then there exists a simple X- 
measurable function <p such that \\f - < s. Extend this to L p , 

1 < p < oo . Is this true for ? 

6.E. If feL p , 1 ^ p < co, and if E — {xe X : |/(x)| # 0}, then 
E is CT-finite. 

6.F. If feL p and if E n = {xe X : \f(x)\ > n), then fx{E n ) -> 0 as 
n — oo. 

6.G. Let X = N, and let fi be the counting measure on N. If / is 
defined on N by /(n) = 1/m, then / does not belong to L lt but it does 
belong to L p for 1 < p ^ oo. [Alternatively, let X = R, X = B, and 
let be Lebesgue measure and define g(x) = 0 for x < 1 and g(x) = 
1/x for x > 1 .] 

6.H. Let X = N, and let A be the measure on N which has measure 
l/« 2 at the point n. (More precisely A(L) = 2 {1 /n 2 : ne £}.) Show 
that A(L') < +oo. Let /be defined on AT by /(«) = Vn. Show that 
/e L p if and only if 1 < p < 2. [For a similar example, let X = (0, 1) 
with Lebesgue measure, and consider g(x) = 1/Vx.] 

6.1. Modify the Exercise 6.H to obtain a function on a finite measure 
space which belongs to L p if and only if 1 ^ p <po. 

6.J. Let {X, X, /x) be a finite measure space. If / is A-measurable, 
let E n = {x e X : (n — 1) < |/(x)[ < n}. Show that feL x if and 
only if 

oo 

2 np{E n ) < +co. 

n = 1 

More generally,/ e Lp for 1 < p < oo, if and only if 

00 

2 n p p(E n ) < + 00 . 

n = 1 

6.K. If {X, X, ft) is a finite measure space and feL p , then f e L r 
for 1 r ^ p. (Hint: Use Exercise 6.J or the inequality |/| r ^ 1 + 



The Lebesgue Spaces L v 63 

\f\ v .) Apply Holder’s Inequality to |/| r in L Plr and g = 1 to obtain 
the inequality 

ll/ll, < IJ/IIp/W 

where s = (1/r) — (1 /p). Therefore, if p(X) = 1, then ||/|| r ^ ||/|| p . 

6.L. Suppose that X = N and p is the counting measure on N. If 
/eL„, then feL s with 1 ^ p ^ s < oo, and |]/|| s ^ ||/|| p . 

« 6.M. Let X = (0,oo), let p be Lebesgue measure on X, and let 
f(x) = x~ V2 {\ + |logx|) _1 . Then feL p if and only if p = 2. 

6.N. Let (X, X, p) be any measure space and let / belong to both 
L Pl and L P2 , with 1 < Pi < p 2 < °° . Prove that fe L p for any value 
of p such that p x < p < p 2 - 

6.0. Let 1 < p < oo, and let (l/p) + (l/q) = 1 . It follows from 
Holder’s Inequality that iifeL p , then 

|J7s^| < Wfl 

for all geL q such that \\g\\ q < 1 . If/ # 0, define g 0 on X by g Q (x) = 
c[signum /(x)]|/(x)| p-1 , where c = (|/||p)“ p;<! . Show that g 0 eL q , 
that || go II « = 1» and that 

|J7*o4*| = ii/iu. 

6.P. Let / e L P (X, X, p), 1 ^ p < oo, and let e > 0 . jShow that 
there exists a set E s eX with p(E e ) < +oo such that if Fe X and 
FnE s = 0, then ||/xr||,. < e. 

6.Q. Let f n e L P (X, X, p) , 1 < p < oo , and let /3„ be defined for 
EeX by 

Pn(E) = {j E \fn\ P dpy P . 

Show that | fi n (E) - p m (E)\ < ||/ n - f m \\ p . Hence, if (/ n ) is a Cauchy 
sequence in L p , then lim flJJE) exists for each EeX. 

6.R. Let f n , f3 n be as in Exercise 6.Q. If (/ n ) is a Cauchy sequence 
and s > 0, then there exists a set E^eX with p(E c ) < +co such that if 
Fe X and F n E t = 0 , then /?„(F) < £ for all ne N. 
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6.S. Let /„, be as in the Exercise 6.R, and suppose that (/„) is a 
Cauchy sequence. If e > 0, then there exists a 8(e) > 0 such that if 
EeX and fi(E) < S(s), then j3 n (E) < s for all neN. (Hint: Use 
Corollary 4. 1 1 .) 

6.T. lffeL x (X, X, fi), then |/(x)j < l/l^ for almost all x:. More- 
over, if A < l/ll then there exists a set Ee X with ^(E) > 0 such 
that \f(x)\ > A for all * e E. 

6.U. If/eZ, p , 1 < p < oo, and g e £«,, then the product./# eL p and 

mu < i/Likiu. 

6.V. The space L„( X, X, ff) is contained in L,(X, X, if and only 
if n(X) < oo. If f/(X) = 1 and /eL m , then 

ll/IU = lim ll/flp. 

p~* 00 



CHAPTER 7 


Modes of Convergence 


We have already had occasion to mention four types of convergence 
of a sequence of measurable functions : pointwise convergence, almost 
everywhere convergence, uniform convergence, and convergence in 
L p . There are two other notions of convergence that are of importance 
in dealing with measurable functions. We shall introduce these in this 
chapter and give interrelations between the various modes. 

For convenience, we shall restate the definitions. In this chapter we 
shall consider only real-valued functions defined on a fixed measure space 
(X, X, n) . In some applications it is necessary to consider extended 
real-valued functions, but this can usually be done by modifying the 
present discussion. In addition we shall limit our attention to L p for 
1 ^ p < oo, since the convergence L„ requires a special examination 
which is usually quite direct. Thus it will be understood that p is 
limited to these values. 

The sequence (/ n ) converges uniformly to / if for every c > 0 there 
exists a natural number N(z) such that if n ^ N(e) and xe X, then 
1/nO) -/(*) | < s. 

The sequence (/„) converges pointwise to / if for every s > 0 and 
xe X there is a natural number N(s, x), such that if n > N(t,x), then 
1/nO) - fix ) I < s. 

The sequence (/„) converges almost everywhere to / if there exists a 
set M in X with p(M) = 0 such that for every e > 0 and xe X\M 
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there exists a natural number N{ s, x), such that if n > N( s, x), then 
lAW -/(*)| < e. 

It is obvious that uniform convergence implies pointwise convergence, 
that pointwise convergence implies almost everywhere convergence, and 
it is easily seen that the reverse implications do not hold. (Of course, 
if X consists of only a finite number of points, then pointwise con- 
vergence implies uniform convergence; if the only set with measure 
zero is the empty set, then almost everywhere convergence implies 
pointwise convergence.) 


CONVERGENCE IN L p 

We now recall the notion of convergence in L p , which was introduced 
in Chapter 6. We remark that an element in L p is an equivalence class 
of functions which are real-valued and whose pth. powers are integrable. 
However, by exercising some caution, we may regard an element of L v 
as being a real-valued measurable function. 

A sequence (/„) in L p = L P (X, X, p) converges in L p to /e L p , if for 
every e > 0 there exists a natural number N(z) such that if n > A/e), 
then 

||/n-/||p = {Ji/„-/| P ^} 1,P < e . 

In this case, we sometimes say that the sequence (/„) converges to /in 
mean (of order p) . 

A sequence (/„) in L p is said to be Cauchy in L p , if for every s > 0 
there exists a natural number N( s) such that if m,n Ss A/s), then 

II fm -Up = {Jl/ m -/n| P <^} 1/P < e. 

We have seen in Theorem 6.14 that if (/„) is Cauchy in L p , then there 
exists an/e L p such that (/„) converges in L p to /. 

The relationship between convergence in L v and the other modes of 
convergence that we have introduced is not so close. It is possible 
(see Exercise 7. A) for a sequence (/„) in L p to converge uniformly on X 
(and therefore pointwise and almost everywhere) to a function /in L p , 
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but not converge in L P . However, if p(X) < +oo, this cannot be 
the case. 

7.1 Theorem. Suppose that p(X) < +oo and that (/„) is a sequence 
in L p which converges uniformly on X to f. Then f belongs to L p and the 
sequence (/„) converges in L p to f. 

Proof. Let e > 0 and let N(s) be such that |/„(x) — f{x)\ < e 
whenever n ^ N(e) and x e X. If n ^ N(&), then 

(7. 1) ||/ n - /lp = { J \f n (x) - f(x)Y d,X' V 

< {Js p ^} 1,P = ep(X) VP , 

so that (/„) converges in L p to /. q.e.d. 

It is possible (see Exercise 7.B) for a sequence (/„) in L p to converge 
pointwise (and therefore almost everywhere) to a function /in L p , but 
not converge in L p even when p{X) < +oo . However, if the sequence 
is dominated by a function in L v , then the L p convergence does take 
place. 

7.2 Theorem. Let (/„) be a sequence in L p which converges almost 
everywhere to a measurable function f. If there exists a g in L p such that 

( 7 - 2 ) l/n(*)| g(x), xsX, ns N, 

thenf belongs to L p and (/„) converges in L p to f. 

proof. In view of inequality (7.2), it follows that \f{x)\ ^ g(x) 
almost everywhere. Since g e L p , it follows from Corollary 5.4 that 
feL v . Now 

l/n(x) - f{x)\ p [2 g(x)] p , a.e., 

and since lim |/ n (x) - f(x)\ v = 0, a.e., and 2 p g p belongs to L lt it 
follows from the Lebesgue Dominated Convergence Theorem 5.6 that 

lim J | f n ~f\ p dp = 0. 

Therefore (/ n ) converges in L p to /. 


Q.E.D. 
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7.3 Corollary. Let p(X) < +co, and let (/„) be a sequence in L p 
which converges almost everywhere to a measurable function f. If there 
exists a constant K such that 

(7.3) |/„(x)| < K, xeX, neN, 

then f belongs to L p and (/„) converges in L p to f. 

proof. If fc(X) < +oo, the constant functions belong to L p . q.e.d. 

It might be suspected that convergence in L p implies almost every- 
where convergence, but this is not the case. In fact, we shall give an 
example of sequence (/ n ) which converges in L p to a function /, but 
such that (f n (x)) does not converge to /(x) for any x in X(l) 

7.4 Example. Let A" = [0,1], X = B, and let A be Lebesgue 
measure. We shall consider the intervals [0, 1], [0,+], [4 , 1], [0,41, 
[h tl, [h i], [0, i], [4, 4], [4, 41, [4, l], [0, |], [i, f], . . . 

Let f n be the characteristic function of the nth interval on this list and 

let / be identically zero. If n ^ m{m + l)/2(= 1 + 2 H 1 - m), 

then f n is a characteristic function of an interval whose measure is at 
most 1 jm . Hence 

Wfn ~ fU = \\fn ~ f\* d\ 

= jf n dh 1/m. 

Therefore (/„) converges in L„ to /. However, if x is any point of 
[0, 1], then the sequence (/„(x)) has a subsequence consisting only of 
l’s and another subsequence consisting only of 0’s. Therefore, the 
sequence (/„) does not converge at any point of [0, 1], (It may be 
observed, however, that one can select a subsequence of (/„) which 
converges to /.) 

CONVERGENCE IN MEASURE 

Although convergence in L p does not imply almost everywhere 
convergence, it does imply another type of convergence that is often of 
interest. 
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7.5 Definition. A sequence (/„) of measurable real-valued functions 
is said to converge in measure to a measurable real-valued function / 
in case 

(7.4) lim fi({x e X : \f n (x) - f(x)\ > «}) = 0 

rt“* CO 

for each a > 0. The sequence (/„) is said to be Cauchy in measure 
in case 

(7.5) lim /x({x e X : | f m (x) - f n (x ) | > «}) = 0 

m.n-* co 

for each a > 0. 

If {f n ) converges uniformly to /, then the set 

{xe X : \f n (x) - f(x)\ > a} 

is empty for sufficiently large n . Hence, uniform convergence implies 
convergence in measure. It is not difficult to show (see Exercise 7.D) 
that pointwise convergence (and therefore almost everywhere con- 
vergence) need not imply convergence in measure, unless the space X 
has finite measure (see Theorem 7.12). We observe, however, that 
convergence in L p does imply convergence in measure. Indeed if 
E n (a) = {xe X : \f n (x) - f{x)\ > a}, then 

f l/n — f\ v dfl ^ f I /„ - f\ p d\L > O? fc(£ n (cc)) . 

J JE n (a) 

Since a > 0, it follows that ||/ n — /j| p 0 implies that a))->0 
as n oo . 

The reader can readily verify that Example 7.4 also shows that a 
sequence can converge in measure to a function but not converge at 
any point. Despite that fact, we shall now prove a result due to 
F. Riesz that implies that if a sequence (/„) converges in measure to /, 
then some subsequence converges almost everywhere to /. Actually 
we shall prove somewhat more than that. 

7.6 Theorem. Let (/„) be a sequence of measurable real-valued 
functions which is Cauchy in measure. Then there is a subsequence 
which converges almost everywhere and in measure to a measurable 
real-valued function f. 
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proof. Select a subsequence ( g k ) of (/ n ) such that the set E k = 
{xeX: |g fc+1 (x) - g k (x)| > 2~ k } is such that fx(E k ) < 2~ k . Let 
F k = U"=fc E i so that F k eX and /x(F k ) < 2- {k ~ l \ If i ^ j > k and 
x$F k , then 

(7.6) |g,M - gj(x)\ < |&(jc) - *_!(*)! + • ■ • + \g j + 1 (x) - g y (x)| 

< J_ , J_ 1 

" 2 i ~ 1 + " ' + 2 i < 2 ,_1 ' 

Let .F = p)”=i so that F e X and n(F) — 0. From the argument 
just given it follows that (g y ) converges on J\f. If we define/ by 

f(x) = lim gj(x) , x $ F, 

= 0, xeF, 

then ( g ,) converges almost everywhere to the measurable real-valued 
function /. Passing to the limit in (7.6) as i oo , we infer that if 
j > k and x f F k , then 


I/M - &MI ^ 2^ < 2^ • 

This shows that the sequence (g y ) converges uniformly to / on the 
complement of each set F k . 

To see that (g y ) converges in measure to /, let a, e be positive real 
numbers and choose k so large that tx{F k ) < 2“ ( ' c " 1) < inf (a, s). If 
j > k, the above estimate shows that 

{* e X : |/M - gfx ) | ^}c{xel: |/(x) - g/x)| > 2'< fc - 1 >} 

£ F k . 

Therefore, fx.({x e X : \f{x) - g/x)| ^ a}) ^ /x(F k ) < s for all j > k, 
so that (g y ) converges in measure to /. q.e.d. 

7.7 Corollary. Let (/„) be a sequence of measurable real-valued 
functions which is Cauchy in measure. Then there is a measurable real- 
valued function f to which the sequence converges in measure. This 
limit function f is uniquely determined almost everywhere. 
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proof. We have seen that there is a subsequence (/ Bfc ) which con- 
verges in measure to a function /. To see that the entire sequence 
converges in measure to /, observe that since 

I/M - /.Ml c I/M -/„* Ml + 1AM - /.Ml, 

it follows that 

{xeX: I/O) - /„0)| > a} c <jx e A" : |/0) - /n fc O)l > 

U j* e * : l/»*M - /»MI > 

The convergence in measure of (/„) to / follows from this relation. 

Suppose that the sequence (/„) converges in measure to both / and g. 
Since 

I/O) - £0)1 < I/O) - /.Ml + 1/nM - £M|, 

it follows that 

{xeX : |/0) - £0)1 > a} z^xeX: |/0) - /„0)l > 

U e JT : |/nO) - £0)1 > 

so that 

/,(0 e T : |/ 0 ) - £ 0)1 » *}) = 0 

for all a > 0. Taking a = 1/n, n e iV, we infer that f = g, a.e. q.e.d. 

It has been remarked that convergence in L p implies convergence in 
measure. In general, convergence in measure does not imply con- 
vergence in L v (see Exercise 7.E). However, this implication does hold 
when the convergence is dominated. 

7.8 Theorem. Let (/„) be a sequence of functions in L r which con- 
verges in measure to f and let gsL„ be such that 

l/.MI < £0). a.e. 

Then f e L v and (/„) converges in L„ to f. 

proof If (/„) does not converge in L v to /, there exist a subsequence 
(gk) of ( f n ) and an s > 0 such that 

( 7 - 7 ) «£*-/«, >« 


\ 


for k e N. 
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Since (g k ) is a subsequence of (/„), it follows (see Exercise 7.G) that it 
converges in measure to /. By Theorem 7.6 there is a subsequence 
(h r ) of (g k ) which converges almost everywhere and in measure to a 
function h. From the uniqueness part of Corollary 7.7 it follows that 
ft — f a.e. Since (h r ) converges almost everywhere to / and is dominated 
by g, Theorem 7.2 implies that \\h r - f\\ p 0. However, this 
contradicts the relation (7.7). q.e.d. 

ALMOST UNIFORM CONVERGENCE 

In the proof of Theorem 7.6 we constructed a sequence (g,) of 
measurable real-valued functions which was uniformly convergent on 
the complement of sets which have arbitrarily small measure. At first 
mention this sounds equivalent to uniform convergence outside a set of 
zero measure, but it is not equivalent (see Exercise 7.J). 

7.9 Definition. A sequence (f n ) of measurable functions is said to 
be almost uniformly convergent to a measurable function / if for each 
3 > 0 there is a set E 6 in X with /j.(E 6 ) < 3 such that (/„) converges 
uniformly to / on X\E 6 . The sequence (/„) is said to be an almost 
uniformly Cauchy sequence if for every 8 > 0 there exists a set E d in X 
with fJt(E d ) < 3 such that (/„) is uniformly convergent on X\ E d . 

The reader is warned that the terminology (in addition to being 
unpleasant) is slightly at variance with the earlier use of the modifier 
“almost.” It is clear that almost uniform convergence is implied by 
uniform convergence, but it is not hard to see that almost uniform 
convergence does not imply this stronger mode. 

7.10 Lemma. Let (/ n ) be an almost uniformly Cauchy sequence. 
Then there exists a measurable function f such that (/„) converges almost 
uniformly and almost everywhere to f. 

proof. If k e N, let E k eX be such that p(E k ) < 2~ k and (/„) is 
uniformly convergent on X\E k . Let F k = IJ/U Ej> so that F k e X 
and n(F k ) < 2' ( ' c_1) . Note that (/„) converges uniformly on X \ F k s 
X \ E k and define g k by 

g k (x) = \imf n (x), x ¥ F k , 

= 0, xeF k . 
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We observe that the sequence ( F k ) is decreasing and that if F = C\F k , 
then Fe X and = 0. If h < k, then g h (x) = g k (x) for all x<£Fh- 
Therefore, the sequence (g k ) converges on all of I to a measurable 
limit function which we shall denote by /. If x <$ F k , then fix) - 
g k (x) = lim f n (x) . It follows that (/„) converges to / on X \ F, so that 
(/„) converges to / almost everywhere on X. 

To see that the convergence is almost uniform, let s > 0, and let K 
be so large that 2~ {K ~ Y) < t. Then g(F K ) < e, and (/ n ) converges 
uniformly to g K = / on X \ F K . q.e.d. 

The next result relates convergence in measure and almost uniform 
convergence. 

7. 1 1 Theorem. If a sequence (/ n ) converges almost uniformly to f, 
then it converges in measure. Conversely, if a sequence (h n ) converges in 
measure to h, then some subsequence converges almost uniformly to h. 

proof. Suppose that (/ n ) converges almost uniformly to /, and let 
a and e be positive numbers. Then there exists a set E e in X with 
< e such that (/„) converges to / uniformly on X\E t . There- 
fore, if n is sufficiently large, then the set {x e X : \f n (x) - f(x)\ > a} 
must be contained in E t . This shows that (/„) converges in measure 
to/. 

Conversely, suppose that ( h n ) converges in measure to h. It follows 
from Theorem 7.6 that there is a subsequence (g k ) of (h n ) which con- 
verges in measure to a function g and the proof of Theorem 7.6 actually 
shows that the convergence is almost uniform. Since ( g k ) converges 
in measure to both h and g, it follows from Corollary 7.7 that h — g a.e. 
Therefore the subsequence ( g k ) of (h n ) converges almost uniformly to h . 

Q.E.D. 

It follows from the Theorem 7.11 that if a sequence converges in L p , 
then it has a subsequence which converges almost uniformly. Con- 
versely, it may be seen (see Exercise 7.K) that almost uniform con- 
vergence does not imply convergence in L v in general, although it does 
if the convergence is dominated by a function in L v (apply Theorem 7.8). 

One of the consequences of Lemma 7.10 is that almost uniform 
convergence implies almost everywhere convergence. In general, the 
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converse is false (see Exercise 7.L). However, it is a remarkable and 
important fact that if the functions are real-valued and if fx(X) < +oo, 
then almost everywhere convergence does imply almost uniform 
convergence. 

7.12 Egoroff’s Theorem. Suppose that p(X) < +oo and that (/„) 
is a sequence of measurable real-valued functions which converges almost 
everywhere on X to a measurable real-valued function f. Then the 
sequence (/„) converges almost uniformly and in measure to f. 

proof. We suppose without loss of generality that (/„) converges at 
every point of X to /. If m , n e N, let 

Efm) = 0 \xeX : \f k (x) - f(x)\ 

k = n { m) 

so that E n (m) belongs to X and E n + 1 (m ) £ E n {m). Since /„(x) fix) 
for all x e X, it follows that 

H En (w) = 0. 

n = 1 

Since p(X) < +oo, we infer that ji{E n {m)) 0 as n +co . If 
8 > 0, let k m be such that fx{E km im)) < 8/2 m and let E s =Um=i E km (m), 
so that E d e X and /x(£ 4 ) < S . Observe that if x $ E d , then x $ E k Jjn) , 
so that 

I AW -/(*)| < i 

for all k ^ k m . Therefore (A) is uniformly convergent on the 
complement of E 6 . q.e.d. 

It is convenient to have a table indicating the relations between the 
various modes of convergence we have been discussing. Modifying 
the idea in Reference [10], we present three diagrams relating almost 
everywhere convergence (denoted by AE), almost uniform con- 
vergence (denoted by AU), convergence in L p (denoted by L v ), and 
convergence in measure (denoted by M ). It is understood that in 
discussing L v convergence, it is assumed that the functions belong to 
L p . Diagram 7.1 pertains to the case of a general measure space. A 



Modes of Convergence 75 



Diagram 7.1 General case 

solid arrow signifies implication; a dashed arrow signifies that a sub- 
sequence converges in the indicated mode. The absence of an arrow 
indicates that a counterexample can be constructed. Diagram 7.2 
relates to the case of a finite measure space. In view of Egoroff’s 

AE — * All 



Theorem two implications are added. In Diagram 7.3, we assume 
that the sequence (/„) is dominated by a function g in L p . Here three 
implications are added. 

We leave it as an exercise to verify that all the implications indicated 
in these diagrams hold, and that no other ones are valid without 
additional hypotheses. 



Diagram 7.3 Dominated convergence 
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We conclude this chapter with a set of necessary and sufficient 
conditions for L p convergence. The reader will observe that the 
second and third conditions are automatically fulfilled when the 
sequence is dominated by a function in L p . 


7.13 Vitali Convergence Theorem. Let ( /„ ) be a sequence in 
L P (X, X, p) , 1 ^ p < oo . Then the following three conditions are 
necessary and sufficient for the L p convergence of (/„) to f : 

(i) (fn) converges to f in measure. 

(ii) For each e > 0 there is a set E t eX with p(E e ) < +oo such that 
r i. » v iffeX and Fn E e = 0, hen 

pr - -y-, 

I , i 1 f/nl ? dp < t 9 for all neN. 

jfe. 




a | 



(iii) For each e > 0 tAere is a S(e) > 0, such that if Ee X and 
p(E) < 8(e) , then 

^e!" J l/ni P dp < e p for all neN. 

proof. It was seen after Definition 7.5 that L p convergence implies 
convergence in measure. The fact that L p convergence of the (/„) 
implies (ii) and (iii) is not difficult and is left to the reader (see Exercises 
6.R and 6.S). 

We shall now show that these three conditions imply that (/„) 
converges in L v to /. If e > 0, let E t be as in (ii) and let F = X\E e 
If the Minkowski Inequality is applied to /„-/„ = (/„ - f m )x Bt + 
fn Xf -fmXF, we obtain 


Wfn-fm 


V 



■b 2e 


for n, m e N. Now let a = s[p(E t ) ] “ Vp and let H nm = {x e E t : | f n (x) 
- /mW| > a). In view of (i), there exists a K{z) such that if n,m J; 
K(z), then p(H nm ) < 8(s). Another application of the Minkowski 
Inequality together with (iii), gives 


f \fn ~ fm\* dp)"’ \fn~f m \’dp 

JE, J UE E \H„ m 

+ {j i/ n | p ^y ,p +{j \f m \ ^ dp 

U».. J <JH nm 


y: up 


< a[p(E c )] llp + s + s = 3s, 
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when n, m ^ A/s). On combining this with the earlier inequality, 
we infer that the sequence (/„) is Cauchy and hence convergent in L p . 
Since we already know that (/„) is convergent in measure to /, it 
follows from the uniqueness in Corollary 7.7 that (/„) converges to 

Q.E.D. 


EXERCISES 

In these exercises (R, B, A) denotes the real line with Lebesgue 
measure defined on the Borel subsets of R. Moreover, 1 < oo. 

7. A. Let /„ = n~ llP xto,m- Show that the sequence (/„) converges 
uniformly to the 0-function, but that it does not converge in L P (R , B, A) . 

7.B. Let f n = n xti/ n . 2 /n]- Show that the sequence (/„) converges 
everywhere to the 0-function but that it does not converge in L P (R ,B,X). 

7.C. Show that both of the sequences in Exercises 7.A and 7.B 
converge in measure to their limits. 

7.D. Let f n = X[n,„+ u • Show that the sequence (/„) converges 
everywhere to the 0-function, but that it does not converge in measure. 

7.E. The sequence in 7.B shows that convergence in measure does 
not imply L p -convergence, even for a finite measure space. 

7.F. Write down a subsequence of the sequence in Example 7.4 
which converges almost everywhere to the 0-function. Can you find 
one which converges everywhere ? 

7.G. If a sequence (/„) converges in measure to a function /, then 
every subsequence of (/„) converges in measure to /. More generally, 
if (fn) is Cauchy in measure, then every subsequence is Cauchy in 
measure. 

7.H. If a sequence (f n ) converges in L p to a function /, and a sub- 
sequence of (/„) converges in L v to g, then f - g a.e. 

7.1. If (f n ) is a sequence of characteristic functions of sets in X, and 
if (fn) converges to /in L p , show that /is (almost everywhere equal to) 
the characteristic function of a set in X. 

7.J. Show that the sequence (/„) in Exercise 7.B has the property 
that if 8 > 0, then it is uniformly convergent on the complement of 
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the set [0,8]. However, show that there does not exist a set of measure 
zero, on the complement of which (/ n ) is uniformly convergent. 

7.K. Show that the sequence in Exercise 7.B converges almost 
uniformly but not in L p . 

7.L. Show that the sequence in Exercise 7.D converges everywhere, 
but not almost uniformly. 

;; 7 -M. Let / n = » jfco.ii- Show that the hypothesis that the limit 

function be finite (at least almost everywhere) cannot be dropped in 
Egoroff’s Theorem. co rA/tits- 

»- 4 f * 

■1,^ imT 7-N. Show that Fatou’s -lemma holds if almost everywhere con- 

- 1 vergence is replaced by convergence in measure. 

7.0. Show that the Lebesgue Dominated Convergence Theorem 
holds if almost everywhere convergence is replaced by convergence in 
measure. 

7.P. If g e L p and |/ n | < g, show that conditions (ii) and (iii) of the 
Vitali Convergence Theorem 7.13 are satisfied. 

7.Q. Let ( X , X, ,u) be a finite measure space. If /is an X-measurable 
function, let 

r(f) = S~ HTf^- 

Show that a sequence (/,) of J¥-measurable functions converges in 
measure to / if and only if r(f n - /) -* 0 . 

7.R. If the sequence (/„) of measurable functions converges almost 
everywhere to a measurable function / and <p is continuous on It to If, 
then the sequence (<p ° / n ) converges almost everywhere to <p °f. Con- 
versely, if <p is not continuous at every point, then there exists a sequence 
(/„) which converges almost everywhere to /but such that (95 ° /„) does 
not converge almost everywhere to <p of. 

7.S. If <p is uniformly continuous on If to It, and if (/„) converges 
uniformly (respectively, almost uniformly, in measure) to /, then 
(9 °fn) converges uniformly (respectively, almost uniformly, in measure) 
to rp °f. Conversely, if <p is not uniformly continuous, there exists a 
measure space and a sequence (/„) converging uniformly (and hence 
almost uniformly and in measure) to / but such that (<p ° f n ) does not 
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converge in measure (and hence not uniformly or almost uniformly) 

to <p of. 

7.T. Let (A', X, /*) be a finite measure space and let 1 < p < oo . 
Let <p be continuous on R to R and satisfy the condition : (*) there exists 
K > 0 such that |<p(r)| < LT|r| for |r| ^ K. Show that <p ° /belongs to 
L v for each feL p . Conversely, if <p does not satisfy (*) , then there is a 
function /in L p on a finite measure space such that <p°f does not belong 
to L p . 

7.U. If (/„) converges to /in L p on a finite measure space, and if <p 
is continuous and satisfies condition (*) of Exercise 7.T, then (<p of n ) 
converges in L p to <p°f. Conversely, if condition (*) is not satisfied, 
there exists a finite measure space and a sequence (/„) which converges 
in Lp to /but such that ( <p ° /„) does not converge in L p to <p ° /. 

7.V. Let (X, X, fx) be an arbitrary measure space. Let <p be con- 
tinuous on R to R and satisfy: (**) there exists K ^ 0 such that 
MOI < for all t e R. If/e L p , then <p o/belongs to L p . Con- 
versely, if <p does not satisy (**),* there exists a measure space and a 
function /e L p such that <p ° / does not belong to L p . 

7.W. If (/ n ) converges to /in L p on an arbitrary measure space, and 
if <p is continuous and satisfies (**), then (<p ° / n ) converges to <p o/in L p . 
Conversely, if <p does not satisfy (**), there exists a measure space and a 
sequence (/,) which converges in L p to /, but such that (<p ° /„) does not 
converge in L p to <p ° /. 



CHAPTER 8 


Decomposition of Measures 


In this chapter we shall consider the possibility of decomposing 
measures and charges in various ways and shall obtain some very 
useful results. First we shall consider charges and show that a charge 
can be written as the difference of two finite measures. 

We recall from Definition 3.6 that a charge on a measurable space 
{X, X ) is a real-valued function A defined on the a-algebra X such that 
A(0) = 0 and which is countably additive in the sense that 

HQ,*) - f KEJ 

for any disjoint sequence (E n ) of sets irt X. The reader can easily check 
the proofs of Lemmas 3.3 and 3.4 to show that if (E n ) is an increasing 
sequence of sets in X, then 

(8.1) A( Q E n \ = lim A (£„), 

and if ( F n ) is a decreasing sequence of sets in X, then 

(8-2) A(n F„) = limA(F n ). 

8. 1 Definition. If A is a charge on X, then a set P in X is said to be 
positive with respect to A if A (E n P) > 0 for any Em X. A set N in 
X is said to be negative with respect to A if A (E n N) < 0 for any E in 
X. A set M in X is said to be a null set for A if A(£ n M) = 0 for 
any E in X. 


80 
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It is an exercise to show that a measurable subset of a positive set is 
positive and that the union of two positive sets is a positive set. 

8.2 Hahn Decomposition Theorem. If X is a charge on X , then 
there exist sets P and N in X with X = P u N, P n N = 0, and such 
that P is positive and N is negative with respect to A . 

proof. The class P of all positive sets is not empty since it must 
contain 0, at least. Let a = sup {X(A) : A e P}, let (A n ) be a sequence 
in P such that lim A (A n ) = a, and let P = U“=i A n . Since the union 
of two positive sets is positive, the sequence ( A n ) can be chosen to be 
monotone increasing, and we shall assume that this has been done. 
Clearly P is a positive set for A, since 

A (E nP) = \(e n (j A^ = a( Q (E n A n j) = lim A (E n A n ) > 0. 

Moreover, a = lim X(A n ) = A (P) < oo. 

We shall now show that the set TV = X \ P is a negative set. If not, 
there is a measurable subset E of N with A (E) > 0. The set E cannot 
be a positive set, for then P u E would be a positive set with \(P u E) > 
a , contrary to the definition of a . Hence E contains sets with negative 
charge; let n x be the smallest natural number such that E contains a set 
E x in X , such that A(£j) 4 - l/n 1 . Now 

A(£\^) = A (£) - A(£j) > A (E) > 0; 

however, E \ E l cannot be a positive set, for then P t = P u (E \ Ef 
would be a positive set with XiPf > a. Therefore E \ E 1 contains 
sets with negative charge. Let n 2 be the smallest natural number such 
that E. \ E x contains a set E 2 in X such that A (£ 2 ) ^ - 1 fn 2 . As before 
E \ (Ex v E 2 ) is not a positive set, and we let n 3 be the smallest natural 
number such that E \ (Ex u E z ) contains a set E 3 in X such that 
X(E 3 ) < — 1/« 3 - Repeating this argument, we obtain a disjoint 
sequence (E k ) of sets of X such that \{E k ) ^ - 1 /n k . Let F = (J“ =1 E k 
so that 

A (F) = J X(E k ) < - fi^O, 

k = l fc = l n k 

which shows that l/n k ^0. If G is a measurable subset of E\F and 
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\{G) < 0, then A (G) < - \/(n k - 1) for sufficiently large k, con- 
tradicting the fact that n k is the smallest natural number such that 
E \ (E 1 u • • • u E k ) contains a set with charge less than —l/n k . Hence, 
every measurable subset G of E \ F must have A (G) ^ 0, so that E \ F 
is a positive set for A. Since A (E\F) = A (E) - A (F) > 0, we infer 
that P u (E\ F) is a positive set with charge exceeding a, which is a 
contradiction. 

Therefore, it follows that the set N = X\P is a negative set for A, 
and the desired decomposition of X is obtained. q.e.d. 

A pair P, N of measurable sets satisfying the conclusions of the 
preceding theorem is said to form a Hahn decomposition of X with 
respect to A . In general, there will be no unique Hahn decomposition. 
In fact, if P, N is a Hahn decomposition for A, and if M is a null set 
for A, then P u M, N\M and P\ M, N u M are also Hahn decom- 
positions for A. This lack of uniqueness is not an important matter 
for most purposes, however. 

8.3 Lemma. If P 1 ,N 1 and P 2 , N 2 are Hahn decompositions for A, 
and E belongs to X, then 

A(£' n P x ) = A (E n P 2 ), A (E n Nf) = A (E n N 2 ) . 

proof. Since E n (P 1 \ P 2 ) is contained in the positive set P x and in 
the negative set N 2 , then A (£ n (P x \ T 2 )) = 0 so that 

KE n Pf) = \(E n P x n P 2 ). 

Similarly, 

A(£ n P 2 ) = A(£nPjnP 2 ), 
from which it follows that 

A (E n P x ) = A(£ n P 2 ) . q.e.d. 

8.4 Definition. Let A be a charge on X and let P, N be a Hahn 
decomposition for A. The positive and the negative variations of A are 
the finite measures A + , A" defined for £ in A by 


(8.3) 


A + (E) = A (E n P), A ~(E) = - X(E n N) . 
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The total variation of A is the measure |A| defined for E in A' by 
\\\(E) = \+(E) + \~(E). 

It is a consequence of Lemma 8.3 that the positive and negative 
variations are well-defined and do not depend on the Hahn decomposi- 
tion. It is also clear that 

(8.4) A(£) = A (E n P) + A {E n N) = A + (£) - \~(E). 

We shall state this result formally. 

8.5 Jordan Decomposition Theorem. If X is a charge on X, it is 
the difference of two finite measures on X. In particular, A is the 
difference of A + and A - . Moreover, if A = p — v where p, v are 
finite measures on X, then 

(8.5) p(E) > A + (£), v(E) > A -(E) 
for all E in X. 

proof. The representation A = A + — A - has already been estab- 
lished. Since p and v have nonnegative values, then 

A + (E) = A (E nP) = p(E n P) - v(E n P) 

< p(E n P) jg p(E) . 

Similarly, A ~(E) < v{E) for any E in X. q.e.d. 

We have seen, in Lemma 5.2, that if a function / is integrable with 
respect to a measure p on X, and if A is defined for E in X by 

(8-6) A (E) = f / dp, 

Je 

then A is a charge. We now identify the positive and negative 
variations of A, 

8.6 Theorem. Iff belongs to L(X, X, p), and A is defined by equation 

(8.6) , then A + , A - , and |A| are given for E in X by 

XffE) = f / + dp, A -(E) = f f~ dp, 

Je Je 

|A|(20 = f |/| dp. 

Je 
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proof. Let P f = {xeX: f(x) > 0} and N f = { xeX: f(x) < 0}. 
Then X = P { u N f and P f n N f - 0 . If E e X, then it is clear that 
\(E n P f ) ^ 0 and A(£ n N,) ^ 0 . Hence P f , N f is a Hahn decom- 
position for A. The statement now follows. q.e.d. 

It was seen in Corollary 4.9 that if / is a nonnegative extended real- 
valued measurable function and y is a measure on X, then the function 
A defined by equation (8.6) is a measure on X. There is a very 
important converse to this which gives conditions under which one 
can express a measure A as an integral with respect to y of a non- 
negative extended real-valued measurable function. It was seen in 
Corollary 4.11 that a necessary condition for this representation is 
that A(£) = 0 for any set £ in A" for which y(E) = 0. It turns out 
this condition is also sufficient in the important case where A and y are 
cr-finite. 

8.7 Definition. A measure A on X is said to be absolutely continuous 
with respect to a measure y on X if £ e X and /x(£) = 0 imply that 
A(£) = 0. In this case we write A « /x. A charge A is absolutely 
continuous with respect to a charge y in case the total variation |A| of 
A is absolutely continuous with respect to |/x| . 

The following lemma is useful and adds to our intuitive under- 
standing of absolute continuity. 

8.8 Lemma. Let A and y be finite measures on X. Then A « y if 
and only if for every z > 0 there exists a S(e) > 0 such that Ee X and 
y(E) < S(e) imply that A (£) < z . 

proof. If this condition is satisfied and y{E) = 0, then A(£) < e 
for all s > 0, from which it follows that A(£) = 0. 

Conversely, suppose that there exist an e > 0 and sets E n eX with 
y{E n ) < 2~ n and A(£ n ) ^ s. Let £„ = (J” =n E k , so that y(F n ) < 
2 _n + 1 and A(£ n ) ^ s. Since (£„) is a decreasing sequence of measurable 
sets, 

4 fl ^n) = lim y(F n ) = 0, 

a( n Fn) = lim A(£ n ) ^ e. 

Hence A is not absolutely continuous with respect to y. q.e.d. 
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8.9 Radon-Nikodym Theorem. Let A and p be a-finite measures 
defined on X and suppose that A is absolutely continuous with respect to p . 
Then there exists a function f in M + {X, A”) such that 

(8.6) A (E)= [ f dp, EeX. 

Je 

Moreover, the function f is uniquely determined p-almost everywhere. 

proof. We shall first prove Theorem 8.9 under the hypothesis that 
A(A') and p(X) are finite. 

If c > 0, let P(c), N(c ) be a Hahn decomposition of X for the charge 
A — cp. If k e N, consider the measurable sets 

A 1 = N(c), A k+1 = N((k + l)c)\ U A } . 

It is clear that the sets A k , Ice N, are disjoint and that 

U N(jc) = JA,. 

It follows that 

A k = N(kc)\\J N(Jc ) = N{kc) n "f] P(Jc). 

j=i j = i 

Hence if £ is a measurable subset of A k , then E £ N{kc) and E £ 
P((k - l)c) so that 

(8.7) ( k - l)cp(E) s: A (E) ^ kcp{E). 

Define B by 

B = X\\J A, = n P(jc), 

i=l i = 1 

so that B s P(kc) for all ke N. This implies that 
0 < kcp{B) ^ A (B) ^ \(X) < +oo 

for all k e N, so that p(B) = 0 . Since A « p, we infer that A (B) = 0 . 

Let f c be defined by f c (x) = (k — l)c for xe A k and f c (x) = 0 for 
xe B. If £ is an arbitrary measurable set, then E is the union of the 
disjoint sets E n B, E n A k , k e N , so it follows from (8.7) that 

f fi dp < ME) < f (/ c + c) dp ^ f f c dp + cp(X) . 

v E v E J E 
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We employ the preceding construction for c = 2~ n , n e N, to obtain 
a sequence of functions we now denote by /„ . Hence 


(8.8) f f n dix < A (E) ^ f /.<#. + 2 ->(Z). 

Je Je 

for all n e TV. Let m ^ n, and observe that 

f fndfi < A(£) < f f m d/x + 2“ m /x(Z), 

Je Je 

f fm dp < A(£) < f f n d(i + 2~ n p{X), 

Je Je 

from which it is seen that 



^ 2->(Z), 


for all E in X. If we let E be the sets where the integrand is positive 
and negative and combine, we deduce that 


Jl/n-/J4^2-" +1 KA0 

whenever rn ^ n. Thus (/„) converges in mean and in measure to a 
function /. Since the f n belong to M + , it is clear from Theorem 7.6 
that we may require that / e M + . Moreover, 


If fndfl - f fdp I I" \f n -/| 4* < fl/n - /I dp, 

\ Je Je \ Je J 

so that we conclude from (8.8) that 

A(£) = lim f /„ dp = f f dp 

Je Je 

for all EeX. This completes the proof of the existence assertion of 
the theorem in the case where both A and p are finite measures. 


We claim that / is uniquely determined up to sets of ju,-measure zero. 
Indeed, suppose that f, he M + and that 
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for all E in X. Let E x = {x : f(x) > h(x)} and E 2 = {x : f(x) < h(x )} , 
and apply Corollary 4.10 to infer that f(x) — h(x) /x-almost everywhere. 

We shall now suppose that A and p are a-finite and let (X n ) be an 
increasing sequence of sets in X such that 

A(X n ) < oo, p(X n ) < oo. 

Apply the preceding argument to obtain a function h n in M + which 
vanishes for x$ X n , such that if E is a measurable subset of X n , then 

A(£) = f h n dp. 

Je 

If n <: m, then X n <=, X m , and it follows that 

\ K dp = \ h m dp 
Je Je 

for any measurable subset E of X n . From the uniqueness of h n , it 
follows that h m (x) = h n (x) for /^-almost all x in X n whenever m ^ n. 
Let /„ = sup {h lt . . . , h n } so that (/ n ) is a monotone increasing sequence 
in M + and let / = lim/ n . If ise X, then 

A (EnX n ) = j E f n dp. 

Since ( E n X n ) is an increasing sequence of sets with union E, it follows 
from Lemma 3.3 and the Monotone Convergence Theorem 4.6 that 

ME) = lim A (E n X n ) = lim f f n dp 

JE 

The /x-uniqueness of /is established as before. q.e.d. 

The function / whose existence we have established is often called 
the Radon-Nikodym derivative of A with respect to p, and is denoted by 
dX/dp. It will be seen in the exercises to have properties closely related 
to the derivative. The reader should observe that this function is not 
necessarily integrable; in fact, f is (/-equivalent to) an integrable 
function if and only if A is a finite measure. 
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In intuitive terms, a measure A is absolutely continuous with respect 
to a measure p in case sets which have small ^-measure also have small 
A-measure. At the opposite extreme, there is the notion of singular 
measures, which we now introduce. 

8.10 Definition. Two measures A, p on X are said to be mutually 
singular if there are disjoint sets A, B in A" such that X = A u B and 
A(/l) = p(B) = 0. In this case we write A X M- 

Although the relation of singularity is symmetric in A and p, we shall 
sometimes say that A is singular with respect to p . 

8.11 Lebesgue Decomposition Theorem. Let A and p he o-finite 
measures defined on a o-algebra X. Then there exists a measure A x 
which is singular with respect to p and a measure A 2 which is absolutely 
continuous with respect to p such that A = A x + A 2 . Moreover, the 
measures A! and X 2 are unique. 

proof. Let v = A + p so that v is a a-finite measure. Since A and p 
are both absolutely continuous with respect to v, the Radon-Nikodym 
Theorem implies that there exist functions /, g in M + (X, X ) such that 

A {E)=\fdv, p{E) = ( g dv 

Je Je 

for all E in X. Let A = {x : g(x) = 0}, and let B = {x : g(x) > 0}, 
so that A n B = 0, and X = A u B. 

Define Aj and A 2 for E in AT by 

A X {E) = A (£ n A), A 2 (£) = A(£ n B). 

Since p(A) — 0, it follows that X P- To see that A 2 « p, observe 
that if p(E) = 0, then 

j^gdv = 0 , 

so that g(x) = 0 for v-almost all x in E. Hence v{E n B) = 0; since 
A « v, 

A 2 (E) = A(£ n B) = 0. 

Clearly A = Aj + A 2 , so the existence of this decomposition is affirmed. 

To establish the uniqueness of the decomposition, use the observation 
that if a is a measure such that a « p, and a P, then a = 0. q.e.d. 
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RIESZ REPRESENTATION THEOREM 

As another application of the Radon-Nikodym Theorem, we shall 
present theorems concerning the representation of bounded linear 
functionals on the spaces L p , 1 ^ p < oo . 

8.12 Definition. A linear functional on L p = L P (X, X, f) is a 
mapping G of L p into R such that 

G(af+ bg ) = aG(f) + bG(g) 

for all a, b in R and /, g in L p . The linear functional G is bounded if 
there exists a constant M such that 

\G(f)\ < M\\f\\ p 

for all/in L p . In this case, the bound or the norm of G is defined to be 

(8.9) ||G|| = sup{|G(/)| :feL v ,\\f\\ p ^ 1}. 

It is a consequence of the linearity of the integral and Holder’s 
Inequality that if g e L q (where q = co when p = 1 and q — p/(p — 1) 
otherwise) and if we define G on L p by 

( 8 - 10 ) G(f) = jfg dp, 

then G is a linear functional with norm at most equal to ||g||,(and it is 
an exercise to prove that ||G|| = ||g|,). The Riesz Theorem yields a 
converse to this observation. 

Before we prove this theorem it is convenient to observe that any 
bounded linear functional on L p can be written as the difference of two 
positive linear functionals (that is, functionals G such that G(f) ^ 0 
for all/ e L v for which / ^ 0). 

8.13 Lemma. Let G be a bounded, linear functional on L„. Then 
there exist two positive bounded linear functionals G + ,G~ such that 
G (f) = G+ (/) ~ G~(f) for all feL p . 

proof. If f> 0 define G + (f ) = sup{G(g) : geL P , 0 ^ g </}. 
It is clear that G + (cf) = c G + (f) for c > 0 and / 2* 0. If 0 < g, < /, 
then 

G(gi) + G(g 2 ) = G( gl + g2 ) s; G + (f + f 2 ). 
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Taking the suprema over all such g } in L p we obtain G + (f 1 ) + G + (f 2 ) < 
G + (f i + / 2 ) . Conversely, if 0 < h < f x + f 2 , let g x = sup (h - f 2 , 0) 
and g 2 = inf ( h,f 2 ). It follows thatg x + g 2 = h and thatO < g, ^ f-. 
Therefore G(h) = G(gf) + G(g 2 ) < G + (f 1 ) + G + (f 2 ); since this holds 
for all such heL p , we infer that 

G + U i +/ 2 ) = G + (/i) + G + (/ 2 ) 

for all fj in L„ such that / y > 0. 

If / is an arbitrary element of L p , define 

G + (f) = G+(/ + ) - G + (f~). 

It is an elementary exercise to show that G + is a bounded linear 
functional on L p . Further, we define G~ for f e L p by 

G-(f) = GHf) - G(f), 

so that G~ is evidently a bounded linear functional. From the defini- 
tion of G + it is readily seen that G~ is a positive linear functional, and 
it is obvious that G = G + - G~ . q.e.d. 

8.14 Riesz Representation Theorem. If (X, X, g) is a a-finite 
measure space and G is a bounded linear functional on LfX, X, /x) , then 
there exists a g in (X, X, g) such that equation (8.10) holds for all f in 
L 1 . Moreover, ||G|| = [[ g[[ m andg > 0 if G is a positive linear functional. 

proof. We shall first suppose that p(X) < oo and that G is positive. 
Define A on X to R by A (E) = G(x £ ); clearly A(0) = 0. If (E n ) is an 
increasing sequence in X and E = \J E n , then (xE n ) converges pointwise 
toxB- Since p(X) < oo, it follows from Corollary 7.3 that this sequence 
converges in L x to xe • Since 

0 < ME) - A (E n ) = G(xe) - G(xeJ 

— G( Xe — Xe „ ) ^ ||G|| II Xb — XE n II 1 J 

it follows that A is a measure. Moreover, if MeX and g(M) = 0, 
then A(M) = 0, so that A « p. 

On applying the Radon-Nikodym Theorem we obtain a nonnegative 
measurable function on X to R such that 

G(xe) = ME) = jxeg dp 
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for all E € X. It follows by linearity that 

G(<p) = jtpgdfi 

for all X-measurable simple functions 93 . 

If/is a nonnegative function in L 1 , let (<p n ) be a monotone increasing 
sequence of simple functions converging almost everywhere and in L x 
to /. From the boundedness of G it is seen that G(f) = lim G(<p n ) . 
Moreover, it follows from the Monotone Convergence Theorem that 

G(f) = lim J<p n g dp. - jfg dp. 

This relation holds for arbitrary f e by linearity. 

We now turn to the o-finite case. If X = (J F n , where (F n ) is an 
increasing sequence of sets in X with finite measure, the preceding 
argument yields the existence of nonnegative functions g n such that 

G (/Xr„) = J fXf n gn dp 

for all/in L\. If m ^ n it is readily seen that g m (x) = g n (x) for almost 
all x in F m . In this way we obtain a function g which represents G. 

If G is an arbitrary bounded linear functional on L 1 , Lemma 8.13 
shows that we can write G = G + - G~, where G + and G~ are bounded 
positive linear functionals. If we apply the preceding considerations to 
G + and G~ , we obtain nonnegative measurable functions g + , g~ 
which represent G + ,G~. If we set g = g + - g~, we obtain the 
representation 

(8-10) G(f) = jfgdp 

for all/eLx. It will be left as an exercise to show that ||(J|| = || ^ [| » . 

Q.E.D. 

8.15 Riesz Representation Theorem. If (X, X, p) is an arbitrary 
measure space and G is a bounded linear functional on L p (X,X,p), 
1 < p < 00 , then there exists a g in L„{X, X, p), where q = p/(p - 1 ), 
such that equation (8.10) holds for all fin L p . Moreover, ||G|| = ||g||,. 
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proof. If fx { X ) < oo, the proof of the preceding theorem requires 
only minor changes to show that there exists a g in L q with ||G|| = ||g||, 
and such that 

G(/) = jfgd,, 

for all / in L p . In addition, the procedure used before applies to 
extend the result to the case where ( X , X, / u.) is cr-finite. 

We now complete the proof by observing that a bounded linear 
functional “vanishes off of a ^-finite set.” More precisely, let (/„) be 
a sequence in L p such that j|/ n || = 1 and 

<?(/„) > |G|l(l - J)- 

There exists a cr-finite set X 0 in X outside of which all the /„ vanish. 
Let Ee X with E n X 0 = 0,then ||/„ ifxElIp < (1 +t p fi(E)) 1 / p for 
t > 0, when pi(E) < oo. Moreover, since 


G(f n ) + G(±t XE ) ^ I G(f n ± t XE )\, 


it follows that 


|G(tt*)l < ||G|||(1 + ?V(^)) 1/P - (l - 
for all n in N. First let « -> oo, and then divide by t > 0, to get 

< | G |0±M1^L. 


If we apply L’Hospital’s Rule as t -> 0+ , we infer that G{ Xe ) = 0, for 
any E € X, f(E) < oo, outside of the cr-finite set A/. Therefore if / 
is any function in L p such that X 0 n {x e X : f(x) + 0} = 0, it follows 
that Gif) = 0. 

Hence we can apply the preceding argument to obtain a function 
g on X 0 which represents G, and extend g to all of X by requiring that 
it vanish on the complement of X 0 . In this way we obtain the desired 
function. q.e.d. 
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EXERCISES 

8. A. If P is a positive set with respect to a charge A, and if Ee X 
and E s P, then E is positive with respect to A. 

8.B. If P x and P 2 are positive sets for a charge A, then P x u P 2 is 
positive for A. 

8.C. A set M in X is a null set for a charge A if and only if | A|(M) = 0 . 
8.D. If A is a charge on X, then the values of A are bounded and 

A + (£) = sup {A(F) : F £ E, Fe X}, 

A -(E) = -inf{A(F) : F £ E, FeX}. 

8.E. Let nx, y 2 , and y 3 be measures on (X, X). Show that y x « y 3 
and that y x « y 2 and y 2 « y 3 imply that y x « y 3 . Give an example 
to show that y x « /x 2 does not imply that p . 2 « . 

8.F. If (fi n ) is a sequence of measures on (X, X) with p. n (X) ^ 1 , let 
A be defined for E in X by 

A (E) = J 2 ">„(£). 

n = 1 

Show that A is a measure and that p. n « A for all n . 

8.G. Let A be a charge and let y. be a measure on ( X , X). If A « y, 
then A + , A", and |A| are absolutely continuous with respect to y. 

8.H. Show that Lemma 8.8 is true even if y is allowed to be an 
infinite measure. However, it may fail if A is an infinite measure. 
[Hint: Let A be the counting measure on N, and let 

y(E) = ^ 2-".] 

nsE 

8.1. Let y be defined as in Exercise 8.H and if E c N, let A be 
defined by 

A (E) = 0, if E = 0; 

= +oo, if E 7^0. 

Show that y is a finite measure on the a-algebra X of all subsets of N, 
and that A is an infinite measure on X. Moreover, A « y and y « A. 

8.J. If A and y are cr- finite and A « y, then the function / in the 
Radon-Nikodym Theorem can be taken to be finite-valued on X. 
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8.K. Let fx be a finite measure, let A « n, and let P n , N n be a Hahn 
decomposition for A - ni x. Let P = f| A., N = IJ N n . Show that 
N is cr-finite for A and that if E £ P, Ee X, then either A (£) = 0or 
X(E) = +oo. 

8.L. Use Exercise 8.K to extend the Radon-Nikodym Theorem to the 
case where ft is cr-finite and A is an arbitrary measure with A « n . Here 
/ is not necessarily finite-valued. 

8.M. (a) Let X be an uncountable set and X be the family of all 
subsets E of X such that either £or X \ E is countable. Let n(E) equal 
the number of elements in E if E is finite and equal +oo otherwise, and 
let A (E) = 0 if £ is countable and equal +co if E is uncountable. Then 
A « fx, but the Radon-Nikodym Theorem fails. 

(b) Let X = [0, 1] and let X be the Borel subsets of X. If fx is the 
counting measure on X and A is Lebesgue measure on X, then A is a 
finite measure and A « fx, but the Radon-Nikodym Theorem fails. 

8.N. Let X, fx be cr-finite measures on (X, X), let A « /z , and let 
/ = dX/dfx. If g belongs to M + (X, X), then 

jgdX = jgfdix. 


{Hint: First consider simple functions and apply the Monotone 
Convergence Theorem.) 

8.0. Let A, fx, v be cr-finite measures on {X, X). Use Exercise 8.N 
to show that if v « A and A « fx, then 


dv dv dX , 

dix = dA d/I ’ / x " a ™ os f everywhere. 

Also, if A ; « fx for j = 1,2, then 

d , , , , dX-\ dXn . 

■j- (A t + \ 2 ) = — + — , ^-almost everywhere. 

8.P. If A and /x are cr-finite, A « fx, and fx « X, then 


dX = 1 

dfx dfx/dX 


almost everywhere. 


8.Q. If A and fx are measures, with X « /x and A J _ ix, then A = 0. 
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8.R. If A is a charge and p is a measure, then |A| _L p implies that 
A+ and A - are singular with respect to p. 

8.S. The collection of all charges on (X, X) is a Banach space under 
the vector operations 

(cp)(E) = cp(E), (A + p)(E) = X(E) + p(E) 
and the norm \\p\\ = |/a|(Af). 

8.T. Suppose g satisfies equation (8.10) for all /in Li and that c > 1 . 
Let E c = {x : |g(x)| ^ c||G|}, and define f c (x) to be ±1 when 
± g(x) > c || (7 1| and to be 0 when x f E c . Then 

c \\G\\p(E c ) < G(f c ) < \\G\\p(E c ), 

which is a contradiction unless p(E c ) = 0. Infer that |g(x)| < ||G|| 
for /a-almost all x. 

8.U. If g satisfies (8.10) for all feL v , show that geL q and that 
M = \\g\\ q . 

8.V. The Riesz Representation Theorem for p = 2 can be proved by 
some elementary Hilbert space geometry (see [5], pp. 249-50). We now 
show that this result can be used to prove the Radon-Nikodym 
Theorem. We shall limit our attention to finite measures A, p with 
A « p. Let v = A + p and show that 

G(f) = jfdX 

defines a positive linear functional on L 2 (X, X, v) with positive norm. 
If g e L 2 (X, X, v) is such that 

G(f) = jfgd v , feL 2 (X,X,v), 

then we see by taking / = xe, Ee X, that 0 ^ g(x) < 1 for v-almost 
all x. Moreover, p{x : g(x) = 1} = 0. Since v = A + p, we have 

Jfc(l - g)d\ = jhgdp 

for all nonnegative h e L 2 (X, X, v) and hence for all nonnegative 
measurable h. Now take h = xsli 1 - g) to infer that 



CHAPTER 9 


Generation of Measures 


In the preceding chapters we have given a few examples of measures, 
but they are of a rather special form, and it is time to demonstrate how 
measures can be constructed. In particular, we wish to show how to 
construct Lebesgue measure on the real line R from the length of an 
interval. 

It is natural to define the length of the half-open interval (a, b ] to be 
the real number b - a and the length of the sets (-00, 6] = {x e R : 
x «; b}, and (a, +00) = {xeR : a < x}, and (-00, +00) to be the 
extended real number +00. We define the length of the union of a 
finite number of disjoint sets of these forms to be the sum of the 
corresponding lengths. Thus, the length of 

U (aj,bj] is 2 ” a,) 

5 ’- 1 i-l 

provided the intervals do not intersect. 

At first glance one might think that we have defined a measure on 
the family F of all sets which are finite unions of sets of the form 
(9.1) (a,b], (~co,b], (a, + co), (-00, +co). 

However, this is not the case since the countable union of sets in' F is 
not necessarily in F, so that F is not a a-algebra in the sense of Definition 
2.1. 

9.1 Definition. A family A of subsets of a set X is said to be an 

algebra or a field in case : 


96 
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(i) 0, ^(belong to A. 

(ii) If E oblongs to A , then its complement X \ E also belongs to A . 

(iii) If E 1 ,. . ,,E n belong to A, then their union IJ? =1 E } also 
belongs to A . 

It is convenient to define the notion of a measure on an algebra. 
In doing so, we require the set function to be countably additive over 
sequences whose union belongs to the algebra. 

9.2 Definition. If A is an algebra of subsets of a set X, then a 
measure on A is an extended real-valued function p defined on A such 
that (i) ;u.(0) = 0, (ii) p(E) > 0 for all Ee A , and (iii) if (E n ) is any 
disjoint sequence of sets in A such that (J" = i E n belongs to A , then 

M E n ) = | p(E n ). 

It seems reasonably clear, but not entirely obvious, that length gives 
a measure. We now prove this fact. 

9.3 Lemma. The collection F of all finite unions of sets of the form 

(9.1) is an algebra of subsets of R and length is a measure on F. 

proof. It is readily seen that F is an algebra. If / denotes the length 
function, then conditions 9.2(i) and (ii) are trivial. To prove (iii) it is 
enough to show that if one of the sets of the form (9.1) is the union of a 
countable collection of sets of this form, then the length adds up 
correctly. We shall treat an interval of the form (a, b], leaving the 
other possibilities as exercises. Suppose, then, that 

(9.2) (a, b] = (j (a s , b,], 

7 = 1 

where the intervals (a jy b,] are disjoint. Let (a lt hj, . . . , (a n , 6 n ] be 
any finite collection of such intervals and suppose that 

a sg a x < b 1 < a 2 < ■ ■ • < b n . i < a n < b n s? b. 

(This may require a renumbering of the indices, but it can always be 
arranged.) Now 

2 /((fly, H) = f 0 b , - a,) 

7=1 7=1 

<6-a = /((a, 6]). 
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Since n is arbitrary, we infer that 

oo 

(9.3) <((a,6]). 

3 = 1 

Conversely, let e > 0 be arbitrary, and let (ej) be a sequence of 
positive numbers with 22 £ j < £ /2. Now consider the intervals 

= ( a j - £j,bj + £j), 3 € TV. 

Prom (9.2) it follows that the open sets {Tj : j 6 TV} form a covering 
of the compact interval [a, 6], Hence, this interval is covered by a 
finite number of the intervals, say by /j, J 2 , . . . , I m . By renumbering 
and discarding some extra intervals we may assume that 

«!—£!< a, b<b m + £ m , 

a j ~ s j < bj-i + £j- 1 , j = 2 , . . . , m. 

It follows from these inequalities that 

m 

b — a <1 ( b m + £ m ) — (aj — £i) } ^ [(bj + £j) — ( Oj — £j)] 

J=l 

m oo 

< ]£(&* - a j) + e < - a i ) + 

3 = 1 3 = 1 

Since e > 0 is arbitrary, it follows that l((a,b}) ^ 

Combining this inequality with (9.3), we conclude that the length 
function l is countably additive on F. q.e.d. 

THE EXTENSION OF MEASURES 

Now that we have given a significant example of a measure defined 
on an algebra of sets, we return to the general situation. We shall 
show that if A is any algebra of subsets of a set X and if ^ is a measure 
defined on A, then there exists a a-algebra A* containing A and a 
measure /x* defined on A* such that n*(E) = for Em A. In 
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other words, the measure /x can be extended to a measure on a a-algebra 
A* of subsets of X which contains A. The procedure that we employ 
is the following: we shall use /x to obtain a function defined for all 
subsets of X , and then pick out a collection of sets on which a certain 
additivity property holds. 

9.4 Definition. If B is an arbitrary subset of X, we define 

(9.4) v*(B) = inf f /*(£)), 

;'=i 

where the infimum is extended over all sequences (E,) of sets in A 
such that 

(9.5) B s 0 Ej. 

y=i 

It should be remarked that the function /x* just defined is usually 
called the outer measure generated by /x. Although this terminology 
is unfortunate because /x* is not generally a measure, /x* does have a few 
properties reminiscent of a measure. 

9.5 Lemma. The function /x* o/ Definition 9.4 satisfies the following: 

(a) n*(9) = 0. 

(b) /x*(fl) 5= 0, for B c A. 

(c) If Ac B, then /x*(A) s? ,x*(5). 

(d) If Be A, then f*(B) = f. i(B). 

( e ) V (^n) is a sequence of subsets of X, then 

Vn=1 ’ n = 1 

proof. Statements (a), (b), and (c) are immediate consequences of 
the Definition 9.4. 

(d) Since {B, 0, 0, . . ,} is a countable collection of sets in A whose 
union contains B, it follows that 

t J *(B) ^ l J -(B) + 0 + 0 + ■ • • = fr(B) . 

Conversely, if ( E n ) is any sequence from A with B E n , then 
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B = {J (B n E n ). Since /la is a measure on A , then 

m < f ^(Bn E n ) ^ J !4E n ), 

n=l n=l 

from which it follows that ^(B) < n*(B) . 

To establish (e), let e > 0 be arbitrary and for each n choose a 
sequence (E nk ) of sets in A such that 

— U Enk and ^ H-(Enk) ^ P-*(B n ) + ^ * 

k ~ 1 k=l z 

Since {E nk : n , k e TV} is a countable collection from A whose union 
contains ij , it follows from the definition of /j.* that 

/*•( u, B n ) < 2 2 n(E nk ) < 2 n*(B n ) + e. 

Since e is arbitrary, the desired inequality is obtained. q.e.d. 

Property (e) of Lemma 9.5 is referred to by saying that //* is countably 
subadditive. 

Although /i* has the advantage that it is defined for arbitrary subsets 
of X, it has the defect that it is not necessarily countably (or even 
finitely) additive. We are willing to restrict /a* to a smaller a-algebra 
provided we can find one containing A and over which fi* has the 
property of countable additivity. There is a remarkable condition 
due to Caratheodory which provides the desired restriction of the 
domain of /a* . 

9.6 Definition. A subset E of X is said to be ^-measurable if 

(9.6) p *(A) = fi*(A n£) + /a *(A \ E) 

for all subsets A of X. The collection of all ju*-measurable sets is 
denoted by A*. 

Condition (9.6) indicates an additivity property on /a*. In loose 
terms, a set E is /A*-measurable in case it and its complement are 
sufficiently separated that they divide an arbitrary set A additively. 
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9.7 Caratheodory Extension Theorem. The collection A* of all 
H*- measurable sets is a o-algebra containing A . Moreover, if (E n ) is a 
disjoint sequence in A* , then 

(9.7) m*( 0 En) = 2 /**(£»)• 

proof. It is clear that 0 and X are ^-measurable, and that if 
E e A * , then its complement X \ E belongs to A*. 

Next we shall show that A* is closed under intersections. Indeed, 
suppose that E and F are /**-measurable. Then for any A s X and 
Ee A* , we have 

H*(A n F) = n*(A n F r\ E) + n*(fA nF)\E ) 

Since F e A * , then 

lx*(A) = M *(/tnF) + n*(A\F). 

Let B = A \(E n F), then it is readily seen that inf = (A F)\E 
and B \ F = A \F; since Fe A* it follows that 

/i*(A \ (E n F)) = p*((A nF)\E) + v*(A \ F). 

Combining these three relations, we obtain 

/j.*(A) = f**(A nEnF) + p *(A \ ( E n F)), 

which shows that E n F belongs to A*. Since A* is closed under 
intersection and complementation, it follows that A* is an algebra. 

Suppose that E, Fe A* and that E n F = 0. If we take A to be 
A n (E u F) in (9.6), we obtain 

H*(A n(£u F)) = fji*(A n E) + n*(A n F). 

For A = X, this relation implies that is additive on A.* . 

We shall now show that A* is a cr-algebra and that n* is countably 
additive on A*. Let (E k ) be a disjoint sequence in A* and let£ = {jE k . 
From the preceding paragraph, we know that F n = (Jfc=i E* belongs 
to A*, and that if A is any subset of X, then 

F*(A) = n*(A n F n ) + p*(A \F n )= 2 ^ n E k ) + /**(. A \ F n ). 

k= 1 
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Since F n £ E, then A \ E £ A\F n and letting n — > oo the above 
relations yields 

00 

2 r*(A n E k ) + f,*(A \ E) < p*(A). 

fc = l 

On the other hand, it follows from Lemma 9.5(e) that 
f**(A nE) ^ f F*( A n E k), 

fc=l 

P*(A) < fi*(A nE) + p*(. A \ E). 

On combining the last three inequalities we infer that 

V*(A) = f**(A nE) + fx*(A \E)= 2 n E k ) + n*(A \ E). 

k=l 

In particular, this shows that E = (j£L a E k is /immeasurable. On 
taking A = E, we obtain (9.7). 

It remains to prove that A £ A*. It was proved in Lemma 9.5(d) 
that if Ee A, then p*(E) = p(E) , but we need to show that E is 
^-measurable. Let A be an arbitrary subset of X; it follows from 
Lemma 9.5(e) that 

H*(A) ^ f**(A nE) + tx*(A \ E). 

To establish the opposite inequality, let e > 0 be arbitrary and let (F n ) 
be a sequence in A such that A £ (J F n and 

00 

2 M-fiO < V*(A) + e. 

n = 1 

Since A n E £ (J ( F n n E) and A \ E £ ( F n \E), it follows from 

Lemma 9.5(e) that 

f*{A nE)^ 2 F{Fn n E), h *{A \ E) < 2 F( F n \ E). 

n=l n=l 

Hence we have 

y*(A nE) + ,,*(A \E)Z 2 ^E) + f.(F n \ E)} 

n= 1 

= 2 < f*(A) + s. 

n = 1 
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Since t is arbitrary, the desired inequality is established and the set E 
belongs to A*. q.e.d. 

The Caratheodory Extension Theorem shows that a measure p on 
an algebra A can always be extended to a measure p* on a a-algebra 
A* containing A. The a-algebra A* obtained in this way is auto- 
matically complete in the sense that if Ee A* with p*(E) = 0, and if 
B £ E, then B e A* and p*(B) = 0. To prove this, let A be an 
arbitrary subset of X and employ Lemma 9.5(c) to observe that 

V*(A) = p*(E) + p*(A) > p*(A n B) + p*(A \ B); 
and, as before, the inequality 

P*(A) < p*(A nB) + p*(A \ B) 
follows from Lemma 9.5(e). Hence B is /^-measurable and 
0 < p*{ B ) s£ p*(E) < 0. 

We shall now show that in the case that p is a a-finite measure, it has 
a unique extension to a measure on A*. 

9.8 Hahn Extension Theorem. Suppose that fx is a o-finite measure 
on an algebra A . Then there exists a unique extension of /a to a measure 
on A* . 

proof. The fact that /x* gives a measure on A* was proved in 
Theorem 9.7 even without the a-finiteness assumption. To establish 
the uniqueness, let v be a measure on A* which agrees with /x on A. 

First suppose that /x and therefore both n* and v are finite measures. 
Let E be any set in A* and let (E n ) be a sequence in A such that 
E £ U En ■ Since v is a measure and agrees with p on A we have 

v(E) ^ v( 0 E„) sc J v{E n ) = J p{E n ) . 

Therefore v{E) < p*(E) for any Ee A*. Since p* and v are additive, 
P*(E) + p*(X \ E) = v(E) + v(X \ E) . Since the terms on the right- 
hand side are finite and not greater than the corresponding terms on 
the left side, we infer that p*(E) = v(E) for all Ee A*. This establishes 
the uniqueness when p is a finite measure. 
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Suppose that n is a-finite and let (F n ) be an increasing sequence of 
sets in A with /j.(F n ) < +co and X = (J F n . From the preceding 
paragraph, fx*(E n F n ) = V (E n F r ) for each EinA*. Therefore 

E*(E) = lim /**(£ n F n ) 

= lim v(E r\ F n ) = v(E), 

so that fj.* and v agree on A* . q.e.d. 


LEBESGUE MEASURE 

We now return to the considerations that prompted the foregoing 
extension procedure, namely, to the generation of a measure on the 
real line R. In Lemma 9.3 we saw that the set F of all finite unions of 
sets of the form 

(a,b], (— 00 , b), (a, + co), (-co, +oo), 

was an algebra of subsets of R and that the length function l gives a 
measure on this algebra F. If we apply the extension procedure to / 
and F, we generate a measure space (R, F* , /*). The o-algebra F* 
obtained in this construction is called the collection of Lebesgue 
measurable sets and the measure /* on F* is called Lebesgue measure.! 

Although we sometimes wish to work with ( R,F*,l *), it is often 
more convenient to deal with the smallest a-algebra containing F than 
with all of F* . It is readily seen that this smallest a-algebra is exactly 
the collection of Borel sets. The restriction of Lebesgue measure to 
the Borel sets is called either Borel or Lebesgue measure. Lest the 
reader feel that restricting to B weakens the theory by substantially 
lessening the collection of measurable sets and functions, we call 
attention to Exercise 9.K where it is seen that every Lebesgue measurable 
set is contained in a Borel measurable set with the same measure, and 
every Lebesgue measurable function is almost everywhere equal to a 
Borel measurable function. 

t It might be thought that every subset of the real line is Lebesgue measurable, 
but this is not the case. For the construction of sets which are not Lebesgue 
measurable, see pp.171 ff. 
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Sometimes it is more convenient to use a notion of the magnitude 
of an interval other than length. This can be treated as follows. Let 
g be a monotone increasing function on R to R so that x ^ y implies 
that g(x ) s? g(y) . In addition, we shall assume that g is continuous 
on the right at every point, so that 

g(c) = lim g(c + ft). 

ft -0 + 

Since g is monotone, it also follows that 

lim g(*), lim g(x) 

x-+ — co x-* + oo 

both exist, although they may be — oo or +oo. 

For such a function we define 

ft]) = g(b) - g(a), 

/a 9 ((-go, ft]) = g(b) - lim g(x), 

x-* - oo 

/+((«, +oo)) = lim g(x) - g(a), 

X-+ + oo 

/x 9 ((-oo,go)) = lim g(x) - lim g(x). 

x-+ + oo x~* - oo 

We further define g. g on the algebra F of finite disjoint unions of such 
sets to be the corresponding sums. If the reader will check the details 
of the proof of Lemma 9.3, he will see that it can be easily modified 
to show that g g gives a cr- finite measure on the algebra F . Therefore, 
this measure has a unique extension, which we also denote by n g to the 
algebra of all Borel subsets of R. This extension is often referred to 
as the Borel-Stieltjes measure generated by g . (Of course, by applying 
Theorem 9.7, g, g has an extension to a complete a-algebra which 
contains the Borel sets. This extension is called the Lebesgue-Stieltjes 
measure generated by g.) 


LINEAR FUNCTIONALS ON C 

We shall conclude this chapter by showing that there is an intimate 
correspondence between Borel-Stieltjes measures on a finite closed 
interval J — [a, ft] and bounded positive linear functionals on the 



106 The Elements of Integration 

Banach space C(J ) of all continuous functions on J to R with the norm 

(9.8) ll/ll = sup {|/(x)| : xeJ}. 

This result, due to F. Riesz, has been considerably extended in many 
directions. Indeed, it is taken as the point of departure for the develop- 
ment of a theory of integration by many authors who prefer to regard 
the integral as a linear functional on spaces of continuous functions. 
We choose to take a very concrete approach to this theorem and offer 
a proof which is closely parallel to the Riemann-Stieltjes integral 
version presented in Reference [1], pp. 290-294. 

9.9 Riesz Representation Theorem. If G is a bounded positive 
linear functional on C(J), then there exists a measure y defined on the 
Borel subsets of R such that 

(9.9) G<J) = J fdy 

for all f in C(J). Moreover, the norm ||G|| of G equals y{J). 

proof. If t is such that a < t < b and n is a sufficiently large 
natural number, let <p t _ n be the function in C(J) which equals 1 on [a, t ], 
which equals 0 on (f + 1 jn,b], and which is linear on (t, t + 1 /«]. If 
n ^ m and x eJ, then 0 < <Pt, m ( x ) < <Pt,n( x ) < 1 , so that the real 
sequence (G(<p tn )) is bounded and decreasing. If t e [a, b), we define 

g(t) = lim G(cp t J. 

n-» + oo 

Further, set g(t) = 0 for t < a; if t ^ b, we set g{t) = G(<p b ) where 
•pfx) = 1 for all x e J . It is readily seen that g is a monotone increasing 
function on R . 

We claim that g is continuous from the right. This is clear if t < a 
or t > b. Suppose that t e [a, b) and e > 0 and let 

n > sup {2, ||G||s _1 } 

be so large that 

g(t ) < G(<p tiU ) ^ g(t ) + s. 
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If >jj n is the function in C(J) which equals 1 on [a, t + «” 2 ], which 
equals 0 on (t + n' 1 — n~ 2 , b], and which is linear on 

(t + n~ 2 , t + n~ 1 — n~ 2 ], 

then an exercise in analytic geometry shows that ||</> n — 9=t.n|| < 1 /«. 
Therefore 

G(« < Gfe.n) + (J) !|G|| < g(t) + 2s, 

so that g(t) ^ g(t + n~ 2 ) < g(t) + 2s. 

According to the Hahn Extension Theorem there exists a measure y 
on the Borel subsets of R such that y ((<*,/!]) = gift) — g(a). In 
particular, this shows that y(E) = 0,if E nj = 0, that 

y([«> c]) = y((fl - 1, c]) = g(c), 

and that ||G|| = \G{<p b )\ = g(b) = 7 (J). 

It remains to show that equation (9.9) holds for /in C(J). If s > 0, 
since / is uniformly continuous on J, there is a S(s) > 0 such that if 
\x - y\ < 8(s) and x,yeJ, then \f(x) - f(y)\ < s. Now let 
a = t 0 < t x < ■ ■ ■ < t m - b be such that sup {t k - t k - x } < ^S(s) and 
choose n so large that 2/« < inf {t k - t k ^ x } and that for A; = 1 , . . . , m, 
then 

(9.10) g(t k ) G(<p tk >n ) sg g(t k ) + e(m||/||)- 1 . 

We now consider functions defined on J by 

m 

fl( x ) =fV l)<Ph.n( X ) + 2 /(hcX?!* ,n(*) - 9t lc . 1 , nW}> 

fc = 2 

m 

fz( x ) = f(h) Xu a ,t L i( x ) + 2 /(he) - 1 .«*](*) • 

k=2 

Note that/ e C(J) and that/ is a step function on J. It is clear that 
SU P (l/C*) — fix ) | : xe/} =g e and as an exercise (or see [1], p. 292) 
the reader can show that [| / - /|| sg e. Therefore we have 


I Gif) ~ G(/i)l < « ||G||. 
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In view of (9.10) we see that if 2 ^ k < m, then 

|G(9 %.n ~ ~ { gttk ) ~ g(tk- l)}| < e(»l l/l)" 1 . 

Apply G to /i and integrate f 2 with respect to y . The inequality just 
obtained yields 

|G(/i) - h dy\^*. 


But since f 2 lies within s of /, we also have 


| j j h d y - [f d y < e y( J )• 

Combining the inequalities, we arrive at the inequality 
I <Kf) ~ l fdy\ < s (2 1| G || + 1), 


and since s is arbitrary, we deduce (9.9). q . e . d . 

If the reader will check the proof of Lemma 8.13, he will see that an 
arbitrary bounded linear functional G on C(J ) can be written as the 
difference G + — G~ of two positive bounded linear functionals. 
Making use of this observation, one can extend the Riesz Representa- 
tion Theorem given above to represent a bounded linear functional on 
C(J ) by means of integration with respect to a charge defined on the 
Borel subsets of J. 


EXERCISES 

9.A. Establish that the family F of all finite unions of sets of the 
form (9.1) is an algebra of sets in R. 

9.B. Show that the family G of all finite unions of sets of the form 

(a,b), (—co, b), (a, +oo), (-co, + 00 ) 

is not an algebra of sets in R. However, the a-algebra generated by G 
is the family of Borel sets. 
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9.C. Show that if the set (a, +oo) is the union of a disjoint sequence 
of sets ( a n , b v ], then 

f l{(a n ,b n ]) = +oo. 

n= 1 

9.D. Let X be the set of all rational numbers r satisfying 0 < r < 1 
and let A be the family of all finite unions of “half-open intervals” of 
the form {r e X : a < r ^ b) , where 0 ^ a ^ b ^ 1 and o, b € Q. 
Show that A is an algebra of subsets of X. Moreover, every non- 
empty set in A is infinite. However, the <r-algebra generated by A 
consists of all subsets of X. 

9.E. If £ is a countable subset of R, then it has Lebesgue measure 
zero. 

9.F. Let /» = («,»+ 1], for « = 0, + 1 , +2, ... . If a subset E 
is contained in the union of a finite number of the {/„}, then l*(E) < 
+oo. However, construct a Lebesgue measurable set E with /*(£) < 
+oo such that l*(E n /„) > 0 for all n. Show that a subset E of R is 
Lebesgue measurable if and only if E n I n is Lebesgue measurable 
for each n . 

9.G. If + is a Lebesgue measurable subset of R and e > 0, show that 
there exists an open set 2 A such that 

1*{A) < l*(G e ) ^ l*(A) + e. 

9.H. If £ is a Lebesgue measurable subset of R, if s > 0, and if 
B £ /„ = (n, n + 1], then there exists a compact set K t 2 B such that 

l*(K s ) < l*(B) l*(K z ) + «. 

{Hint: Apply the Exercise 9.G to A - I n \B.) 

9.1. If A is an arbitrary Lebesgue measurable set in R, apply the 
preceding exercises to show that 

1*{A) = inf {/*((/) : A S G, G open}, 
l*(A) = sup {/*(£) : K c A, K compact}. 

9.J. Let A = /* denote Lebesgue measure on R, and let + be a 
Lebesgue measurable set with A(+) < +oo. If e > 0, there exists an 



110 The Elements of Integration 


open set which is the union of a finite number of open intervals such 
that 

Wxa - Xg\\ = A(AAG) ^ e. 

Moreover, if e > 0 there exists a continuous function / such that 

II Xa - /Hi = J| Xa -f\dX< e. 

9.K. Let A be a Lebesgue measurable subset of R. Show that 
there exists a Borel measurable subset B of R such that A s B and 
such that l*(B \ A) — 0. (Hint: Consider the case where l*(A) < +oo 
first.) Show that every Lebesgue measurable set is the union of a 
Borel measurable set (with the same measure) and a set of Lebesgue 
measure zero. In the terminology of Exercise 3.L, this asserts that 
the Lebesgue algebra is the completion of the Borel algebra. As a 
consequence of Exercise 3.N, we infer that every Lebesgue measurable 
function is almost everywhere equal to a Borel measurable function. 

9.L. If g belongs to L(R, B, A) and e > 0, then there exists a con- 
tinuous function / such that 

Ik -/Hi = j\g ~ f\ d\ < s. 

9.M. If B is the Borel algebra and A is Lebesgue measure on B, 
show (i) A (G) > 0 for every open G^0, (ii) A (K) < +oo for every 
compact set K, and (iii) A(x + E) = A (E) for all E e B. (Here 
x + E = {x+y\ ye E}.) 

9.N. Let A be a set, A an algebra of subsets of X, and p a measure 
on A. If B is an arbitrary subset of X, let p'(B) be defined to be 

p'(B) = inf {p(A) : B s A e A}. 

Show that fj,'(E) = ^(E) for all Ee A and that fi*(5) ^ p-'(B). More- 
over, fi* = p 1 in case X is the countable union of sets with finite 
//.-measure. Is p' countably subadditive in the sense of 9.5(e)? 

9.0. Let X be an uncountable set and let A be the collection of sets 
E which are either finite or have finite complement. In the former 
case let p(E) = 0; in the latter, let p(E) — +oo. Show that p is a 
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measure on A. Calculate the outer measure /a* corresponding to 
Definition 9.4. Calculate the set function /a' defined in Exercise 9.N. 
Are they the same ? 

9.P. Let A be a set and let a be defined for arbitrary subsets of X to 
R and satisfy 

0 < a (E) < ol(E Uf)< «(£’) + a (F ) , 

when E and F are subsets of X. Let S be the collection of all subsets 
E of X such that 

a(A) = a(A n E) + a(A \ E) 

for all A £ X. If S # 0, it is an algebra and a is additive on S. 

9.Q. It may happen that the collection S in Exercise 9.P is empty. 
For example, let a(E) = 1 for all E £ X. 

9.R. Let X and A be as in Exercise 9.D, and let A x be the <r-algebra 
generated by A. Let /aj be the counting measure on A l and let 
fx 2 = 2^i- Show that g 1 = fi 2 on A but not on A x . (Hence the 
<T-finiteness hypothesis in Theorem 9.8 cannot be dropped.) 

9.S. Let g be a monotone increasing and right continuous function 
on R to R . If fig is defined as at the end of this section, show that fi g is a 
measure on the algebra F. 

9.T. Consider the following functions defined for x £ R by: 

(a) £i(x) = 2x, (b) g 2 (x) = Arc tan x, 

(c) S 3 (x) = 0, x < 0, (d) gi(x) = 0, x < 0, 

= 1, x > 0, = x, x ^ 0. 

Describe the Borel-Stieltjes measures determined by these functions. 
Which of these measures are absolutely continuous with respect to 
Borel measure? What are their Radon-Nikodym derivatives? 
Which of these measures are singular with respect to Borel measure? 
Which of these measures are finite? With respect to which of these 
measures is Borel measure absolutely continuous ? 

9.U. Let /a be a finite measure on the Borel sets B of R and let 

g(x) = /a((— oo, x]) for x e R. Show that g is monotone increasing 

and right continuous, and that 

fi((d, £]) = g(b) - g{a) 
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when —co < a ^ b < +oo. Show that n(R) = lim g(x). 

X-* co 

9.V. Let/be Riemann integrable on [a, b ] to R. Then there exists 
a monotone increasing sequence ( <p n ) and a monotone decreasing 
sequence (</>„) of step functions such that q> n (x) < f(x) < >fi n (x) for 
x e [a, b] and 

lim j<p n dX = lim j>p n dX . 

(Here A denotes Lebesgue measure.) Show that / = lim <p n = lim <p„ 
almost everywhere, that / is Lebesgue measurable, and that 
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Product Measures 


Let X and Y be two sets ; then the Cartesian product Z = X x Y is 
the set of all ordered pairs ( x , j) with x e X and y e Y. We shall first 
show that the Cartesian product of two measurable spaces {X, X) 
and ( Y, Y) can be made into a measurable space in a natural fashion. 
Next we shall show that if measures are given on each of the factor 
spaces, we can define a measure on the product space. Finally, we 
shall relate, integration with respect to the product measure and iterated 
integration with respect to the measures in the factor spaces. The 
model to be kept in mind throughout this discussion is the plane, which 
we regard as the product R x R. 

10. 1 Definition. If (X, X) and ( Y, F) are measurable spaces, then a 
set of the form A x B with A e X and Be Y is called a measurable 
rectangle, or simply a rectangle, in Z = X x Y. We shall denote the 
collection of all finite unions of rectangles by Z 0 . 

It is an exercise to show that every set in Z 0 can be expressed as a 
finite disjoint union of rectangles in Z (see Exercise 10.D). 

10.2 Lemma. The collection Z 0 is an algebra of subsets of Z. 

proof. It is clear that the union of a finite number of sets in Z 0 
also belongs to Z 0 . Similarly, it follows from the first part of Exercise 
10.E that the complement of a rectangle in Z belongs to Z 0 . Apply 
De Morgan’s laws to see that the complement of any set in Z 0 belongs 
t° Z 0 . q.e.d. 

113 
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10.3 Definition. If (X, X ) and ( Y, Y) are measurable spaces, then 
Z = X x Y denotes the cr-algebra of subsets of Z = X x Y generated 
by rectangles A x B with A e X and B e Y. We shall refer to a set in 
Z as a Z-measurable set, or as a measurable subset of Z. 

If (X, X, (f) and ( Y, Y, v) are measure spaces, it is natural to attempt 
to define a measure tt on the subsets of Z = X x Y which is the 
“product” of ft, and v in the sense that 

tt(A x B) = fi(A) v(B), A e X, B 6 Y. 

(Recall the convention that 0(±oo) = 0.) We shall now show that 
this can always be done. 

10.4 Product Measure Theorem. If ( X , X, f) and (Y, Y, v) are 
measure spaces, then there exists a measure n defined on Z = X x Y 
such that 

(10.1) n(A x B) = /c(A) v(B) 

for all AeX and BeY. If these measure spaces are o-finite, then there 
is a unique measure tr with property (10.1). 

proof. Suppose that the rectangle A x B is the disjoint union of a 
sequence (A } x If) of rectangles; thus 

co 

x A * B {x,y) = xa(x) x B (y) = 2 XAi(x)xB,(y) 

3 = 1 

for all xe X, ye Y. Hold x fixed, integrate with respect to v, and 
apply the Monotone Convergence Theorem to obtain 

Xa(x) v{B) = 2 XA,(x)v(Bfi. 

3 = 1 

A further application of the Monotone Convergence Theorem yields 
p(A) v(B) = 2 

i = l 

Now let Ee Z 0 ; without loss of generality we may assume that 
E= U (A, x Bj), 

7 = 1 
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where the sets A } x B, are mutually disjoint rectangles. If we define 
ME) by 

ME) = ^ rfAtMB,), 

7 = 1 

the argument in the previous paragraph implies that -n- 0 is well-defined 
and countably additive on Z 0 . By Theorem 9.7, there is an extension 
of 77 o to a measure n on the a-algebra Z generated by Z 0 . Since v is 
an extension of n 0> it is clear that (10.1) holds. 

If (X, X, /x) and ( Y, Y, v) are a-finite, then 7r 0 is a a-finite measure 
on the algebra Z 0 and the uniqueness of a measure satisfying (10.1) 
follows from the uniqueness assertion of the Hahn Extension Theorem 
9.8. Q.E.D. 

Theorem 10.4 establishes the existence of a measure n on the a-algebra 
Z generated by the rectangles {A x B : A e X, B e F} and such that 
(10.1) holds. Any such measure will be called a product of fi and v. 
If /x and v are both a-finite, then they have a unique product. In the 
general case the extension procedure discussed in the previous chapter 
leads to a uniquely determined product measure. However, it will be 
seen in Exercise 10.S that it is possible for two distinct measures on Z 
to satisfy (10.1) if y. and v are not a-finite. 

In order to relate integration with respect to a product measure and 
iterated integration, the notion of a section is useful. 

10.5 Definition. If E is a subset of Z = X x Y and x e X, then 
the .v-section of E is the set 

E x = {yeY : ( x,y)eE } 

Similarly, if y e Y, then the j-section of E is the set 
E y — {xe X \ (x,y)eE}. 

If/ is a function defined on Z to R, and x e X, then the x-section of / 
is the function f x defined on Y by 


fx(y)=f(x,y), yeY. 
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Similarly, if ye 7, then the y-section of /is the function/ 3 ' defined 
on X by 

p(x) = /(x,y), xeX. 

10.6 Lemma, (a) If E is a measurable subset ofZ, then every section 
of E is measurable. 

(b) If f is a measurable function on Z to R, then every section of f is 
measurable. 

proof, (a) If £ = A x B and x e X, then the x-section of E is B 
if x e A, and is 0 if x $ A. Therefore, the rectangles are contained in 
the collection E of sets in Z having the property that each x-section is 
measurable. Since it is easily seen that E is a cr-algebra (see Exercise 
10.1), it follows that E = Z. 

(b) Let xe X and ae R, then 

{y e y ■ fxiy) > «} = {y e 7 : f(x, y) > a} 

= {(*, y) 6 X X 7 : fix, y) > a} x . 

If / is Z-measurable, then f x is 7-measurable. Similarly, f y is X- 
measurable. q.e.d. 

We interpolate an important result, which is often useful in measure 
and probability theory, and which will be used below. We recall (see 
Exercise 2. V) that a monotone class is a nonempty collection M of sets 
which contains the union of each increasing sequence in M and the 
intersection of each decreasing sequence in M. It is easy (see Exercise 
2.W) to show that if A is a nonempty collection of subsets of a set S, 
then the <x-algebra S generated by A contains the monotone class M 
generated by A. We now show that if A is an algebra, then S = M. 

10.7 Monotone Class Lemma. If A is an algebra of sets, then the 
u-algebra S generated by A coincides with the monotone class M generated 
by A . 

proof. We have remarked that M s S. To obtain the opposite 
inclusion it suffices to prove that M is an algebra. 

If E e M, define M(E) to be the collection of Fe M such that 
E\E, E n F, F\ E all belong to M. Evidently 0, EeM{E) and it is 
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readily seen that M(E ) is a monotone class. Moreover, F e M(E) if 
and only if E e M(F) . 

If E belongs to the algebra A, then it is clear that A £ M(E). 
But since M is the smallest monotone class containing A , we must have 
M(E) = M for E in A . Therefore, if E e A and Fe M, then F e M(E) . 
We infer that if E e A and Fe M, then E e M(F) so that A £ M(F) for 
any Fe M. Using the minimality of M once more we conclude that 
M{F) = M for any Fe M. Thus M is closed under intersections and 
relative complements. But since X e M it is plain that M is an algebra ; 
since it is a monotone class, it is indeed a a-algebra. q.e.d. 

It follows from the Monotone Class Lemma that if a monotone class 
contains an algebra A , then it contains the a-algebra generated by A . 

10.8 Lemma. Let ( X , X, p) and ( Y , Y, v) be o-finite measure spaces. 
If Ee Z = X x Y, then the functions defined by 

(10.2) f(x) = v(E x ), g(y) = p(E y ) 
are measurable, and 

(10.3) ^fdp. = n(E) = j y g dr. 

proof. First we shall suppose that the measure spaces are finite 
and let M be the collection of all Ee Z for which the above assertion 
is true. We shall show that M = Z by demonstrating that M is a 
monotone class containing the algebra Z 0 . In fact, if E = A x B 
with A e X and Be Y, then 

/(*) = Xa(x) v(B), giy) = xs(j) p(A), 

ix = ^ ^ = j y g dv ' 

Since an arbitrary element of Z 0 can be written as a finite disjoint 
union of rectangles, it follows that Z 0 £ M. 

We now show that M is a monotone class. Indeed, let (E n ) be a 
monotone increasing sequence in M with union E. Therefore 

fn(x) = v{(E n ) x ) , gfy) = ix((E n y) 
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It is clear that the monotone increasing sequences (/„) and (g n ) converge 
to the functions / and g defined by 

f(x) = v(E x ), g{y) = p(E y ). 

If we apply the fact that n is a measure and the Monotone Convergence 
Theorem, we obtain 

f dp- = n{E) = j^g dv, 

so that E e M. Since n is finite measure, it can be proved in the same 
way that if (F„) is a monotone decreasing sequence in M, then F = P) F n 
belongs to M. Therefore M is a monotone class, and it follows from 
the Monotone Class Lemma that M = Z. 

If the measure spaces are <r-finite, let Z be the increasing union of a 
sequence of rectangles (Z„) with ir(Z„) < +oo and apply the previous 
argument and the Monotone Convergence Theorem to the sequence 
(£nZ n ). q.e.d. 

10.9 Tonelli’s Theorem. Let (X, X, p) and (Y, Y, v) be o-finite 
measure spaces and let F be a nonnegative measurable function on 
Z = X x Y to R. Then the functions defined on X and Y by 

(10.4) fix) = £ F x dv, giy) = j F y dp , 
are measurable and 

(10.5) f /4*= f Fd*= f gdv. 

Jx Jz Jy 

In other symbols, 

(10.6) J (J Fdv j dp = J F dir = J (J Fdpj dv. 

proof. If F is the characteristic function of a set in Z, the assertion 
follows from the Lemma 10.8. By linearity, the present theorem holds 
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for a measurable simple function. If F is an arbitrary nonnegative 
measurable function on Z to R, Lemma 2.11 implies that there is a 
sequence (® n ) of nonnegative measurable simple functions which 
converges in a monotone increasing fashion on Z to F. If 9 \ and <p n are 
defined by 

(10.7) <Pn(x) = J y (<!>,), dv, Uy) = \ x m y dir , 

then <p n and ifi n are measurable and monotone in n. By the Monotone 
Convergence Theorem, (<p„) converges on X to/ and (f n ) converges on 
Ytog. Another application of the Monotone Convergence Theorem 
implies that 



The same theorem also shows that 

J F drc — lim J <b n dir, 

from which (10.5) follows. q.e.d. 

It will be seen in the exercises that Tonelli’s Theorem may fail if we 
drop the hypothesis that F is nonnegative, or if we drop the hypothesis 
that the measures ft, v are a-finite. 

Tonelli’s Theorem deals with a nonnegative function on Z and 
affirms the equality of the integral over Z and the two iterated integrals 
whether these integrals are finite or equal + 00 . The final result 
considers the case where the function is allowed to take both positive 
and negative values, but is assumed to be integrable. 

10.10 Fubini’s Theorem. Let (X,X,p) and (Y, Y, v) be o-finite 
spaces and let the measure n on Z = X x Y be the product of /j- and v . If 
the function F on Z = X x Y to R is integrable with respect to 7 r, then 
the extended real-valued functions defined almost everywhere by 

/(*) = F x dv, g(y) = J F y dp 


( 10 . 8 ) 
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(10.9) fdix = Fdn= gdv. 

Jx Jz Jy 

In other symbols , 

(10.10) I [I F dv j d/j. = j F dir = J J^J* _Fd|uJ dv . 

Proof. Since F is integrable with respect to n, its positive and 
negative parts F + and F~ are integrable. Apply Tonelli’s Theorem 
to F + and F~ to deduce that the corresponding / + and f~ have finite 
integrals with respect to fi. Hence f + and f~ are finite-valued 
/-(-almost everywhere, so their difference / is defined yu-almost every- 
where and the first part of (10.9) is clear. The second part is similar. 

Q.E.D. 

Since we have chosen in Chapter 5 to restrict the use of the word 
“integrable” to real- valued functions, we cannot conclude that the 
functions /, g defined in (10.8) are integrable. However, they are 
almost everywhere equal to integrable functions. 

It will be seen in an exercise that Fubini’s Theorem may fail if the 
hypothesis that F is integrable is dropped. 


EXERCISES 

10.A. Let A £ X and B £ Y. If A or B is empty, then A x B = 0. 
Conversely, if A x B = 0, then either A = 0 or B — 0. 

10.B. Let Aj £ X and B, £ Y, j = 1 , 2. If A x x B x = A 2 x B 2 
^ 0, then Ai = A% and B\ — B 2 - 

10.C. Let Aj £ X and B t £ Y,j= 1,2. Then 

(Ax x Bx) u (A 2 x B 2 ) = [(Ax \ A 2 ) x Bf\ 

u [(A i o A 2) x (Bx u BA] ^*[(^2)^1) x F2] , 


and the sets on the right side are mutually disjoint. 
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10.D. Let ( X , X) and (Y, Y) be measurable spaces. If A } e X and 
B, e Y for j = 1 , ,m, then the set 

U (A, x B ,) 

3 = 1 

can be written as the disjoint union of a finite number of rectangles in Z. 

10.E. Let A, c X and B } c y, j = 1,2. Then 

(Ai x By) \ (A 2 X B 2 ) = [(A, n .4 2 ) x \ fi 2 )] 

U [(Ay \ A 2 ) X By] 

(Ay x By) Pi (A 2 x Bj) = Pi /4 2 ) x Pi Bj). 

10.F. If (R, B) denotes the measurable space consisting of real 
numbers together with the Borel sets, show that every open subset of 
R x R belongs to B x B. In fact, this cr-algebra is the cr-algebra 
generated by the open subsets of R x R. (In other words, B x B is 
the Borel algebra of if x R .) 

10.G. Let / and g be real- valued functions on Zand Y, respectively; 
suppose that / is Z-measurable and that g is F-measurable. If h is 
defined for (r,j) in lx Y by h(x,y) = f(x)g(y), show that h is 
X x K-measurable. 

10.H. If Eis a subset of R, let y(E) = {(x, y) e R x R : x - y e E}. 
If Ee B, show that y(E) e B x B. Use this to prove that if / is a 
Borel measurable function on R to R, then the function F defined by 
•F(*> y) = f( x ~ y) is measurable with respect to B x B. 

10.1. Let E and F be subsets of Z = X x Y, and let xe X. Show 
that (E \ F) x = E X \F X . If (E a ) are subsets of Z, then 

(U E a ) x = U {E a ) x . 

10.J. Let (X, X, n) be the measure space on the natural numbers 
X = N with the counting measure defined on all subsets of X = N. 
Let (F, Y,v) be an arbitrary measure space. Show that a set E in 
Z = X x Y belongs to Z = X x Y if and only if each section E n of E 
belongs to F. In this case there is a unique product measure it, and 

n(E) = f v (E n ), Ee Z. 

Tt. = 1 
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A function / on Z = X x Y to R is measurable if and only if each 
section/, is Y-measurable. Moreover, /is integrable with respect to n 
if and only if the series 

2 f \fn\ dv 

n=i Jy 

is convergent, in which case 


J f d7T = 2 [J /» dv \ = f [ 2 dv - 

Jz n = i LJy Jy L n "Ti 


10.K. Let X and Y be the unit interval [0,1] and let X and Y be the 
Borel subsets of [0, 1]. Let ft be Lebesgue measure on X and let v be 
the counting measure on Y. If D = {(x, y) : x = y), show that D is a 
measurable subset of Z = X x Y, but that 


j v(D x ) dfj.(x) # j dv(y). 


Hence Lemma 10.8 may fail unless both of the factors are required to 
be a-finite. 

10.L. If F is the characteristic function of the set D in the Exercise 
10.K, show that Tonelli’s Theorem may fail unless both of the factors 
are required to be o-finite. 

10.M. Show that the example considered in Exercise 10.J demon- 
strates that Tonelli’s Theorem holds for arbitrary ( Y, Y, v) when 
(X, X, ft) is the set N of natural numbers with the counting measure on 
arbitrary subsets of N. 

10.N. If a mn ^ 0form,«eiV, then 

CO CO CO GO 

2 2 ^ mn = 2 2 ^ mn ( ^ • 

m = 1 n = l n= 1 m= 1 


10.0. Let a mn be defined for m, n e N by requiring that a nv = + 1 , 
a n,n+i = ~ 1 , and a mn = Oifm^«orm#rt-f 1. Show that 


2 2 «- 


2 2 


1 s 


so the hypothesis of integrability in Fubini’s Theorem cannot be 
dropped. 
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10.P. Let /be integrable on (X, X, g) , let g be integrable on ( Y, Y, v) , 
and define h on Z by h(x, y) = f{x) g(y) . If n is a product of g and v, 
show that h is 7r-integrable and 

/.**■ If. HLC *'*■]• 

10.Q. Suppose that ( X , X, g) and ( Y, Y, v ) are a-finite, and let 
E, F belong to X x Y. If v(E x ) — v(F x ) for all xe X, then tt(E) = tt(F) . 

10.R. Let / and g be Lebesgue integrable functions on (R, B) to R. 
From Exercise 10.H it follows that the function mapping ( x,y ) into 
f(x - y) g(y) is measurable with respect to B x B. If A denotes 
Lebesgue measure on B, use Tonelli’s Theorem and the fact that 

f I fix ~ y) I dX(x) = f \f{x)\ d\(x) 

Jr Jr 

to show that the function h defined for x e R by 
Kx) = f f(x - y) g(y) dk(y) 

J R 

is finite almost everywhere. Moreover, 

/H < [J I/I <^a] [J |g| Ja] . 

The function h defined above is called the convolution of / and g and is 
usually denoted by/* g. 

10. S. Let X = R, X be the <7-algebra of all subsets of R and let y. be 
defined by fj.(A) = 0 if A is countable, and /x(A) = +oo if A is un- 
countable. We shall construct distinct products of g with itself. 

(a) If E g Z = X x X, define n (E) = 0 in case E can be written as 
the union E = G U H of two sets in Z such that the x-projection of G 
is countable and the ^-projection of H is countable. Otherwise, define 
7 r (E) = +oo. It is evident that tt is a measure on Z. If ME) = 0, 
then E is contained in the union of a countable set of lines in the plane. 
If A, B e X, show that MA x B) = MA) MB) ■ Hence tt is a product 
of y, with itself. 
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(b) If E e Z, define p(E) = 0 in case E can be written as the union 
E=GvH'JKo{ three sets in Z such that the x-projection of G is 
countable, the ^-projection of H is countable, and the projection of K 
on the line with equation y = x is countable. Otherwise, define 
p(E) = +oo. Now p is a measure on Z, and if p(E) = 0, then E is 
contained in the union of a countable set of lines. Show that 
p(A x B) = p(A) fi(B) for all A, BeX; hence p is a product of p. with 
itself. 

(c) Let E = {(x, >>) : x + y = 0}; show that EeZ. However, 
p(E) = 0, whereas n(E) = +co. 
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CHAPTER 11 


Volumes of Cells and Intervals 


A cell in R with endpoints a, b (where a < b) is a set having one 
of the following four forms: 


(i) 

(a, b) 

(ii) 

[a, 6] 

(iii) 

(a, 6] 

(iv) 

[a, b) 


= {x e R : a < x <b}, 

= {x & R : a < x < b}, 

= {x € R : a < x < b}, 

= {x € R : a < x < b}. 


A cell with the first form is called an open cell with endpoints a, b. 
A cell having the second form is called a closed cell with endpoints 
a, b. Cells having the third or fourth forms are called half-open or 
half-closed cells. We note that if a = b, then only the closed cell is 
nonvoid. 

An open cell in R is an open subset of R, and a closed cell is 
a closed subset of R; both of these types of cells are often conve- 
nient to work with because they fit into the topological scheme of 
R. A nonvoid half-open cell is neither open nor closed, so it does not 
have very much topological interest; however, half-open cells have the 
advantage that the intersection, the union, or the difference of two 
half-open cells (of the same kind) are either half-open cells, or the 
union of two disjoint half-open cells. Moreover, the complement of 
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a half-open cell (a, 6 ] is the union of the sets (-00, a] and (b, +00). 
It can also be written as the union of a countable collection of pair- 
wise disjoint half-open cells of the form («*,&], where i e N and 
oci, Pi € R. 

Cells are also often called intervals; however, the sets 


(V) 

(—00, a) := 

= {x 

e 

R 

: x < 

0}, 

(vi) 

(—00, a] := 

= {x 

€ 

R 

: x < 

a}, 

(vii) 

[ 6 , +00) := 

= {x 

e 

R 

:b< 

a:}, 

(viii) 

( 6 , +00) := 

= {x 

€ 

R 

:b< 

x}, 

(ix) 

(-00, +00) := 

= R 

= 

{x 

eR 

: —00 


are also called intervals but they are not called cells. The intervals 
(v) and (vi) have the right endpoint a, and the intervals (vii) and 
(viii) have the left endpoint b. The intervals (v) and (viii) are open 
subsets of R; the intervals (vi) and (vii) are closed subsets of R. The 
final interval (ix) is both open and closed in R. 

The length of a cell in R with endpoints a < b is defined to be 
equal to b — a. Thus all four of the cells 


(a,b), (a, b], [a, b), ( a,b ] 

have the same length. We will frequently write 

l([a, b )) := b — a. 

If I is any cell with endpoints a < b, then the interior 1 ° of I is the 
open cell (a, b), and the closure I~ of I is the closed cell [a, b]. Thus 
if I is any cell with endpoints a < b, then the length of I is equal to 
the lengths of its interior 1 ° and of its closure I~; in symbols 

1(1) = 1(1°) = l(I-). 

The length of any one of the intervals (v) - (ix) is taken to be +00; 
we write, for example, 


l((— 00, a]) = +00. 
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CELLS, CUBES, AND INTERVALS 

If p € R p , p > 1, then a cell I in R p is the Cartesian product of 
p cells in R] thus 

I = h x ... x I p . 

An open cell in R p is the Cartesian product of p open cells in R, 
and a closed cell in R p is the Cartesian product of p closed cells in 
R. An open cell in R p is an open set in R p , and a closed cell in R p 
is a closed subset of R p . We say that a cell I in R p is half-open if 
it is the Cartesian product of cells I\,...,I p in R all of which have 
the (“southwest”) form [a, 6), or all of which have the (“northeast”) 
form (a,b\. It is an exercise to show that if I and J are half-open 
cells in R p having the same form, then InJ,Il)J, and I- J are the 
unions of a finite number of pairwise disjoint half-open cells in R p . 
Similarly, the complement I c := R p — I is the union of a countable 
collection of pairwise disjoint half-open cells in R p . 

A cube in R p is a cell all of whose sides have equal length. 
Cubes may be open, closed, or half-open. An interval in RP is the 
Cartesian product of p intervals in R. Thus a cell in RP is also an 
interval in R p , but not conversely. 

R I = Ii x ... x I p is & cell in R p , then the p-dimensional 
volume 1(1) of / is defined to be the product of the lengths of the 
sides of Ji, . . . , Ip. Thus, if the endpoints of Ij are aj < bj for j = 
1 , ... ,p, then the volume of I is given by 

1(1) := (6j - ai) . . . (b p - a p ), 

As in the case of cells in R, if / = x . . . x I p is a cell in R p and 
the endpoints of Ij are aj < bj, then the interior 1° of I in RP is 
the open cell 

1° := (<*!, &i) x ... x (ap, b p ) 
and the closure I~ of I in R p is the closed cell 

I~ := [oi,&i] x ... x [a p ,b p ]. 
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Hence the volume 1(1) of a cell I in R p is equal to the volume 1(1°) 
of its interior, and also to the volume l(I~) of its closure. 

We also note that the empty set 0 can be considered to be an 
open cell with equal endpoints. Consequently, the above definition of 
the volume yields 1(0) = 0. 

Let I = Ii x . . . x I p be an interval in R p ; then we define the 
^dimensional volume 1(1) of I to be the product 


1(1) :=l(h)...l(I p ) 


of the lengths of the p sides. This product is to be interpreted as 
equaling 0 if at least one of the side lengths l(I\), l(I p ) is equal 
to 0 (even though some of the lengths may equal +oo). In addition, 
this product equals +oo if all of the side lengths 1(h), ■■■, l(I P ) are 
different from 0 and at least one of them equals +oo. 

TRANSLATION INVARIANCE 

One thing worth noticing about the volume of cells in R p is 
that it is independent of the “location” of the cells in the space, and 
depends only on the lengths of the sides. We express this idea by 
saying that the volume function is “translation invariant” . To clarify 
this notion, we introduce the following definition. 

11.1 Definition. If A is a subset of R p and x is any vector 
in R p , then the translation of d by x is defined to be the set 

x (B A := {x + a : a £ A}. 

It is easy to see that the translation of a cell I in R p is also a 
cell; moreover, it is an exercise to show that l(x © I) = 1(1) for all 
vectors x and cells / in the space R p . 



CHAPTER 12 
The Outer Measure 


We wish to extend the notion of the volume of a set in R p to sets 
that are more general than cells or intervals. Of course, if a set E in 
R p can be obtained as a disjoint union of a finite number of cells, it 
is natural to define the volume of E to be the sum of the volumes of 
the corresponding cells. However, such a set E may be decomposed 
in many ways as the disjoint union of cells and it is not immediately 
clear that different decompositions of E would always lead to the 
same value for the volume of E. In addition, many — in fact, most 
sets cannot be obtained as a finite union of cells; the circular disk 
D := {(a;, y) € R 2 : x 2 + y 2 < 1} is an important example of such 
a set. It is possible to show that D can be obtained as the disjoint 
union of a countable family of cells in R 2 ; in fact, this can be done 
in infinitely many ways. To define the 2-dimensional volume (that 
is, the “area” ) of D, we would surely need to use an infinite series 
for each decomposition, and to show that different decompositions of 
D give the same 2-dimensional volume for D. Even for as simple a 
figure as the disk D, this would not be an easy task. 

Our approach in generalizing the notion of volume to an arbitrary 
subset of R p will be by a process of “overestimation”. Basically, we 


131 



132 The Elements of Lebesgue Measure 


will enclose the set in the union of a countable collection of cells, 
and then minimize the sum of the volumes of the cells by taking the 
infimum over all such countable collections. (This process is rather 
similar to the construction of the upper integral that is often done in 
elementary calculus courses.) This “generalized volume” is useful for 
many purposes and has the advantage that it is defined for all subsets 
of R p . Unfortunately, it will be shown later that this generalized 
volume is not sufficiently well-behaved on the collection of all subsets 
of R p . However, when this generalized volume is restricted to a 
suitable collection (to be denoted by £) of subsets of R p , it will be 
seen to have some very satisfactory properties. In Chapter 4, this 
collection £ will be shown to contain all of the subsets of R p that 
arise in ordinary work. In particular, £ contains all open subsets 
and all closed subsets of R p , and it contains all countable unions, all 
countable intersections, and all complements of sets that belong to 
the collection £. 

12.1 DEFINITION. If E C R p , we define the outer measure 
m*(E) of E to be 

{ +oo 

k= 1 

where the infimum is extended over all sequences (I k ) of cells in R p 
that cover E in the sense that 

+oo 

(12.2) EC\Jl k . 

k= 1 



(1) We note that, since the entire set R p is contained in the 
union of a sequence (I k ) of cells in R p , the collection of all sequences 
that satisfy (12.2) is not empty. Therefore, the infimum appearing in 
(12.1) is well-defined and it is evident that m*(E) > 0. We also note 
that it is certainly possible to have m*{E) = +oo; for example, this 
is the case when we take E = R p . 
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(2) The terms l(I k ) certainly satisfy 0 < 1(h) < +oo for all k. 
Hence, the series 

+oo 

(12.3) £*(/*) 

k= 1 

is either (i) absolutely convergent, in which case the value of its sum 
does not depend on the order of summation, or (ii) the series (12.3) 
is divergent (= convergent to +oo), in which case we assign the value 
+oo as its sum. 

(3) Since the convergence or the divergence of (12.3) does not 
depend on the order of the summands, it is equally appropriate to 
think of (h) as being a countable family of cells rather than a sequence 
of cells. 

Before we proceed, we wish to make some further observations 
about Definition 12.1 that will be important. 

12.2 Remarks, (a) We may restrict the cells (I k ) in Definition 
12.1 to be closed cells if we wish to do so. Indeed, since I k C Iff, it is 
trivial that if (12.2) holds, then we also have E C Iff. Since, as 
was noted in the preceding chapter, we have l (Iff) = 1(h), the value 
of m*(E) is not affected by using the closures of the cells. 

(b) We may also restrict the cells (I k ) to be open cells if we wish 
to do so. Indeed, suppose that m*(E) < +oo and that (12.2) holds 
for a sequence (I k ) of cells. If e > 0, then let (J k ) be a sequence of 
open cells such that 

h C J k and l(J k ) < l(I k ) + e/2 k 

for k = 1, 2, Then we have 

-foo -foo +oo 

£C(Jj*. and ^l(Jfc) < J2l(h) +£■ 
k=l k= l *=i 

Hence it follows (why?) that the same result is obtained in (12.1) by 
using open cells as using arbitrary cells. 
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(c) If desired, we may restrict the cells ( I k ) to be half-open cells, 
having either form. 

(d) If 6 > 0 is given, we may restrict the cells (J fc ) to have 
diameter less than 8. [Recall that if A C R p , then the diameter of 
A is defined to be 

diam(.A) := sup{||x - y|| : x,y £ A}. 

Hence, if x, y £ A, then we have ||a:-y[| < diam( J 4).] To see that this 
is true, note that any half-open cell can be obtained as the union of 
a finite collection of pairwise disjoint half-open cells with diameters 
less than 8. For example, one can successively bisect the sides of the 
cell to obtain cells with diameter less than the preassigned 8. 

We shall now establish the basic properties of the outer measure 
function m*. 

12.3 Theorem. The outer measure function m* defined in 
Definition 12.1 satisfies: 

(i) 0 < m*(E) < Too for all E C R p , and m*(0) = 0; 

(ii) if E C F, then m*(E ) < m*(F); 

(iii) if ( Ek ) is a sequence of subsets of R p , then 

( +oo \ +oo 

<5>w 

fc=l / fc=l 

PROOF, (i) The first property has already been noted. If we 
take I k = 0 for all k € N, then 0 C fj U / 2 U . . . , so 

0 < m*(0) < 0 TO T . . . = 0. 

Therefore (i) is proved. 

(ii) If (h) is a sequence of cells with F C US h, then also 
E C (Jfe “ h- Therefore m*(E) < m*(F), proving (ii). 

(iii) It suffices to prove the assertion in the case that m*(E n ) < 
Too for each n £ N. (Why?) Let s > 0 and, for each n £ N, choose 
a sequence ( I^) k of cells such that 

+oo -foo 

E n c\Jl% and ^ m * ( E n) + s/2 n - 

A=1 k—l 
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Since { J] " : k,ne N} is a countable family of cells that cover the set 
Un=i E n , it follows from the definition of m* that 


( +00 \ +00 -f-OO +oo 

U B » s E wi-EE'M) 

n— 1 / fc,n— 1 n=l k—l 

4-oo 4-oo 

< £(m*(K) + e/2 n ) = £ m*(£ n ) + e. 

n = 1 n = 1 

[Since m*(E) > 0 for any set E C R p , the change from a double 
sum to an iterated sum is justified.] Now, since e > 0 is arbitrary, 
property (iii) is proved. Q.E.D. 

Property (iii) of Theorem 12.3 is referred to by saying that m* 
is countably subadditive on the collection of all subsets of R p . In 
particular, it follows from (iii) that if A and B are disjoint subsets 
(that is, if A fi B = 0), then 

m*(AuB) < m*(A) + m*(B). 

Our intuitive feeling about volume is that equality should hold in this 
relation. Unfortunately, it is not the case (as we will prove in a later 
chapter) that equality holds for an arbitrary pair of disjoint subsets 
of R p . However, we now show that this desired equality relation is 
true in case the sets A and B are at a positive distance from each 
other. 

12.4 Theorem. Let A and B be disjoint subsets of R p with 
dist(^4, B) := inf{||a — 6|| : a e A, b e B} > 0. Then we have 

m*(A U B) = m*(A) + m*{B). 

proof. We have already seen that it is always the case that 
m*(A U B) < m*{A) + m*(B). Therefore (why?), it suffices to prove 
the opposite inequality under the hypothesis that m*(A U B) < +oo 
and 6 dist(^4, B) > 0. 
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If s > 0 is given, let (J n ) be a covering of A U B such that 

-f-oo 

Y. l(In) < m*(AU B) + e. 

71 = 1 

As was noted in Remark 12.2(d), we may assume that the cells ( I n ) 
have diameter less than 6. In this case, no cell I n can contain both 
points in A and points in B, for then we would have dist(A, B) < 6, 
contrary to the hypothesis. Hence we may divide the cells {/„} into 
three classes: (i) the cells {Jj} that contain points in A, (ii) the cells 
{K k } that contain points in B, and (iii) the cells {H h } that do not 
contain points in either A or in B. Therefore, we have 

m*(A)<Y,l(Jj) and m*(B) < 

j k 

from which it follows that 

m*{A) + m*(B) < + £1^) 

j k h 

n 

Therefore, we have m*(A) + m*(B) < m*(A U B) + e. Since e > 0 is 
arbitrary, we infer that m*{A) -f m*(B) <m*(A U B), as was to be 
shown. q.e.d. 

It is conceivable that the value m* gives to a cell might be differ- 
ent from the p-dimensional volume of the cell; however, we will now 
show that such an unfortunate situation cannot occur. 

12.5 Theorem. If I is any cell in R p , then m*(I ) = 1(1). 
PROOF. Since the sequence (1, 0, 0, . . .) is a covering of I, it fol- 
lows that m*(I) < 1(1) + 0 + 0+ . . . = 1(1). To establish the opposite 
inequality, let e > 0 be given and let (/*,) be a covering of I by open 
cells such that 

+oo 

Sl(4)<m*(/) + e. 

k= 1 
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If I is a closed cell, let J := 7; otherwise, let J be a closed cell such 
that J C I and 1(1) — s < l(J). By the Heine-Borel Theorem there 
is an m € N such that J C ur =1 4. 

We now divide the space R p into a finite number of closed inter- 
vals by extending the (p — l)-dimensional hyperplanes that contain 
a face of one of the cells and of J. Let K u ..., K n be 

the distinct closed cells into which the cells are divided 

by these hyperplanes; further, let be the closed cells into 

which J is divided. Therefore, we have 

Z(J)=£Z(4) <£/(#*) 

j=l h — 1 

m 

<£*(/*) <m* (/) + £. 

*=i 

Consequently, 1(1) < l(J) +£ < m*(I) + 2s. Since s > 0 is arbitrary, 
we deduce that 1(1) < m*(I). Therefore it follows that 1(1) = m*(I), 
as claimed. q.e.d. 

TRANSLATION INVARIANCE 

We noted at the end of Chapter 11 that the volume of a cell is 
not changed by a translation. We now show that the analogous result 
holds for the outer measure of an arbitrary subset of R p . 

12.6 Theorem. If E C R p and x e R p , then we have 
m*(x © E) = m*(E). 

PROOF . If the sequence (Ik) of cells is a covering for E, then the 
sequence (x © /*,) of x- translates is a covering of x © E by cells. Since 
we have 1(1 k ) = ^ l( x © I k ), it follows that 



= inf 


= m*(E). 
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Since every covering (J k ) of the set x ® E by cells can be obtained 
by translating a covering (K k ) of E (how?), it also follows that we 
have m*(E) < ro*(x © E), whence m*(E) = m*(x © E). We leave 
the precise details as an exercise. Q.E.D. 



CHAPTER 13 


Measurable Sets 


In the preceding chapter we defined the outer measure m*(E) of 
an arbitrary subset E of R p . While the function m* has the distinct 
advantage of being defined for every subset of R p , it is not always 
additive over disjoint subsets; that is, it does not always satisfy 

m*(AUB) = m*(A) + m*(B) 

when A and B are sets such that A n B = 0. 

In this chapter we will prove that, by restricting m* to a certain 
family £ of subsets of R p , we obtain a function that not only is 
additive over disjoint sets, but is even countably additive in the sense 
that if ( E n ) is a sequence of sets in £ that are pairwise disjoint (i.e., 
E n n E m — 0 for n ^ m), then the union E E n belongs to 

£ and 

+0O 

(13-1) m *( E) = J2m*(E n ). 

n— 1 

This countable additivity property is a very desirable one; in fact, it 
is this property that makes many of the properties of the Lebesgue 
theory of integration work nicely. 
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The main theorem in this chapter is due to the well-known Greco- 
German mathematician Constantin Caratheodory. We will state this 
result for the outer measure m* introduced in Chapter 12, although 
his theorem remains true for any function satisfying the properties 
stated in Theorem 12.3. 

In order to simplify the statement of Caratheodory’s theorem, 
we introduce some terminology that is often useful. 

13.1 Definition. Let X be an arbitrary set. Then a family 
S of subsets of X is said to be a cr-algebra in X if the following 
conditions are satisfied: 

(i) 0 and X belong to E; 

(ii) if E € E, then the complement E c — X -E belongs to E; 

(iii) if (E n ) is a sequence of sets in S, then the union IJn^i 
belongs to E. 

Remarks, (a) If £ is a cr-algebra of subsets of X, then the 
intersection of a sequence of sets in E also belongs to E. 

(b) If X is any set, then Ej := {0, X} is a (rather trivial) 
example of a a- algebra of subsets of X. 

(c) If X is any set and E is a subset of X, then E 2 
{0, E, E c , X} is a cr-algebra of subsets of X. 

(d) If X is any set, then E 3 := {all subsets of X} is a a- 
algebra of subsets of X. 

(e) If X is a set and Ei and E 2 are <r-algebras of X, then 
Ei n E 2 (= the collection of all subsets of X that belong to both Ei 
and E 2 ) is also a cr-algebra of subsets of X. 

13.2 Definition. Let X be a set and let E be a cr-algebra of 
subsets of X. Then an extended real- valued function f-t defined on E 
is said to be a measure on E in case it satisfies: 

(i) m(0)=O; 

(ii) 0 < n(E) < +00 for all E e E; 

(iii) if (E n ) is a sequence of sets in E that are pairwise disjoint, 
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then we have the equation 

( +oo \ +oo 

U Bn = £>(£.)• 

n=l / n=l 

Remarks, (a) Since /i(E) > 0, the series on the right side of 
equation (13.2) is either absolutely convergent, or it is properly di- 
vergent to +oo. 

(b) If X = N and E = {all subsets of X}, then define n(E) to 
be the number of elements in E if E is a finite set, and fi(E) := +oo 

if E is an infinite set. Then p is a measure on E and is called the 

counting measure on N. 

We now introduce the “test for measurability” of a set E C R p . 

13.3 Definition. Let m* be the outer measure defined on 
all subsets of R p . A set E C R p is said to satisfy the Caratheodory 
condition in case 

(13.3) m*(A) = m*(A D J5) + m*(A fl E c ) 

for all A C R p . The collection of all such sets will be denoted by C. 
We note that the condition (13.3) can also be written in the form 

m*(A) = m*(Ar\E) + to* (A - E). 

Intuitively, a set E satisfies the Caratheodory condition if E and its 
complement E c split every set A “additively” . The sets that satisfy 
the condition (13.3) are precisely the desired “measurable sets” . The 
next lemma shows that the task of showing that a set E satisfies 
(13.3) can be simplified somewhat. 

13.4 Lemma. A set E satisfies the Caratheodory condition if 
and only if, for each set A with to* (A) < +oo, then 


(13.4) 


to* (A) > m* (A (1E) + to* (A n E c ). 
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PROOF . Indeed, since AC\E and A D E c are disjoint and have 
union A, it follows from Theorem 12.3(iii) that we always have the 
inequality 

m*(A) < m*(AnE) + m*(AnE c ). 

Hence, if (13.4) is satisfied, then so is (13.3). Finally, we note that 
(13.4) is trivial in case m*{A) = +oo, so it is only necessary to treat 
the case m*(A ) < +oo. Q.E.D. 

We will now obtain Caratheodory’s theorem. 

13.5 Theorem. (Caratheodory) Let m* be the outer measure 
defined in Definition 12.1. Then the set C of all subsets of R p that 
satisfy the Caratheodory Condition 13.3 is a o -algebra of subsets of 
R p . Moreover, the restriction of m* to C. is a measure on C. 

proof. It is clear that 0 satisfies (13.3), and that if E satisfies 
(13.3), then so does its complement E c . Hence the family of sets that 
satisfy the Caratheodory condition satisfies properties (i) and (ii) of 
Definition 13.1. 

We now show that if E and F satisfy (13.3), then so does EnF. 
For, since E € £, then 

m*{A) = m*(A D E) + m*(A n E c ) 
for any A C RP. Since F e £ we have 

m* {A n E) = m* ( A n E n F) + m* (A n E n F c ), 
whence it follows that 

m*(A) = m*(An E n F) + m*(An E n F c ) + m*(An E c ). 

But since E € C, we also have 

m*(An(Er\F) c ) 

= m* ( A n {E n F) c n E) + m* (A n (E n F) c n E c ) 

= m*(^4 n F c n E) + m*(A n E c ). 


s \ C: 
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It therefore follows that 

m* (4) = to* ( A n (F n F) ) + m* ( A n (E n F) c ) 

for all sets A. Consequently, E(~\F belongs to £. 

Since £ contains the complements of sets in £, it follows from 
De Morgan’s Laws that if E, F € £, then E U F € £. Moreover, if 
BnF = 0, then it follows from the fact that E satisfies (13.3) with 
A replaced bydn(SuF) and F = F fl E c that 

TO* (A n (BU F)) = TO* (A n (E U F) n E) + m*(A n(£uF)n E c ) 
= to* (A n E) + to* (A n F). 

By induction we conclude that if E x ,...,E n belong to £ and are 
pairwise disjoint, then E x U . . . U E n belongs to £ and 

to* (A n (E\ U . . . U En)) = m’(An£i) + ... + m*(An E n ) 


for all AC R p . 

We now show that £ is a c-algebra and that m* is countably 
additive on £. To do this, let ( E n ) be a pairwise disjoint sequence in 
£ and let E := E k . We know that F n := (J£ =1 E k belongs to £ 
for all neN. Moreover, if A C R p , then 

m* (A) = m*(AnF n ) + m*(AnFZ) 

= m * An^j +m*(Anf n c ) 

n 

= 53 m *(^ n E k) + m *(A n F®). 

*=i 

Since F n C E, then A n F® D A n F c for all n e JV, so that 


w* (A) > 53 m *( yine *) + TO*(AnF®), 

fc=i 
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which implies that 

+oo 

(13.5) to* (A) > J2m*(AnE k ) + m*(AnE c ). 

k—l 

On the other hand, it follows from the countable subadditivity of m* 
that 

( +oo \ +oo 

<J2m*(AnE k ). 

k= 1 / k = 1 


Therefore, we have 

m*(A) > m*(Ar\E) + m*(AnE c ), 

which (in view of Lemma 13.4) implies that E e C, so that £ is a 
<r-algebra. In addition, if we take A — E in (13.5) and (13.6), we 
obtain 

+oo 

m\E) = Y,rn*{E k ), 

k= 1 

which shows that m* is countably additive on C. Q.E.D. 

13.6 Definition. If m* is the outer measure defined in Def- 
inition 12.1, then the cr-algebra C of subsets of R p that satisfy the 
Caratheodory Condition 13.3 is called the Lebesgue ct- algebra of 
R p . A set E € C is called a Lebesgue measurable subset of R p , 
or briefly, a measurable subset of R p . The restriction m of m* to 
L is called Lebesgue measure on RP . 

Since m is the restriction of m* to the a- algebra C, we have 
m(E) = m*(E) for all E £ C. Ordinarily, when we know that a set 
E is measurable, we will write m{E) instead of m*{E). 

The next result assures that cells in RP are measurable sets and 
that their Lebesgue measure coincides with their volume. 

13.7 Theorem. If I is a cell in RP , then I is measurable, and 
hence m(I) = 1(1). 
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PROOF. We will give the proof for an open cell, leaving to the 
reader to extend the conclusion to an arbitrary cell in R p . It was 
seen in Lemma 13.4 that it suffices to show that if A C R p is such 
that m*{A) < +oo, then 

m*(A) >m*(Ar\I) + m*(A — I). 

Let n e N and let I n {x e I : dist(x, I c ) > 1/n}, so that I n C I. 
Moreover, since I — /„ is contained in the union of 2 p cells each of 
which has one side with length 1/n, then m*{I-I n ) — > 0 as n — > +oo. 

Note that A D (A n /„) U (^4 - 1) and dist(j4 n A - 1) > \/n. 
Therefore, it follows from Theorem 12.4 that 

m*(^)>m*((^n7 n )U(i4-7)) 

1 ' = m*(AnI n ) + m*(A-I). 


On the other hand, since 

Anl = (Anl n )u (An(l-i n )), 

it follows from the subadditivity and the monotone character of m* 
that 

m*(An.I n ) <m*(AnI) <m*(AnI n ) + m*(I -I n ). 

Therefore, we have 


m*(An I) = lim m*(Anl n ). 

n— > oo 

Hence, taking the limit in (13.7), we have 

m*(A) >m*(Anl) + m*(A — 7), 

which shows that I is a measurable set, by Lemma 13.4. Q.E.D. 

We have been able to obtain a measure defined on a a-algebra C 
of sets that agrees with the volume function l, originally defined only 
for cells. Consequently, we have been successful in extending l to a 
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larger collection of sets, and we will soon see that the collection £ is 
a very large collection of sets. It is conceivable, however, that there 
may be another measure defined on £ that agrees with l on cells. We 
now show that this is not the case. 

13.8 Theorem. If p is a measure defined on £ that is such 
that p(I) = 1(1) for all open cells I C R p , then p = m. 

PROOF. For n € TV, let I n be the open cell 
I n := (— n,n) x . . . x (- n,n ). 

Let 17 € £ be any set with E C I n and let ( J*,) be a sequence of open 
cells such that E C (J^Ti Jk- Since pis & measure and p(Jk) = l(Jk) 
for all k e TV, we have 


( +oo \ +oo +oo 

[jJk <£>(J*) = £w 
fc=l J *=1 Jfe=l 

Therefore, we have p(E) < m*(E) = m(E) for all measurable sets 
E C I n . Since p and m are additive, then 

j 

p(E) + p(I n ~E) = p(I n ) = m(I n ) = m(E) + m(I n - E). 

Since all of these terms are finite and p(E) < m(E) &nxTp(I n - E) < 
m(I n — E), it follows that p(E) — m(E) for all measurable sets 
ECI n . 

Now an arbitrary measurable set E can be written as the union 
of a disjoint sequence (E n ) of sets, defined by 

I7i := Bnlj, £„:=£(!(/„-/„_!) for n> 1. 

Since p(E n ) — m(E n ) for all n e IV, it follows that 

+00 -f-OO 

p(E) = £>(£«) = £ m (E„) = m(E). 


Thus p and m agree on all measurable sets. 


Q.E.D. 
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We conclude this chapter with two useful results that are simple 
properties of the positivity and the countable additivity of Lebesgue 
measure. 

13.9 Theorem. If E and F are Lebesgue measurable sets and 
if E C F, then m(E) < m(F). If, in addition, m(E ) < +oo, then 
m(F — E) = m{F) — m{E). 

PROOF. Since m is additive, it is immediate from the fact that 
F = E U (F - E) and E n (F - E) = 0 that 


m(F) = m(E) + m(F - E). 

Since m(F - E) > 0, we have m(F) > m(E). If m(E) < +oo, then 
we can subtract m(E) from both sides of the above equation. Q.E.D. 

13.10 Theorem, (a) If ( Ek ) is an increasing sequence of 
measurable sets, then 

(l 3 - 8 ) ml M E k ) = lim m(E k ). 

\*=i . / ^°° 

(b) If ( l <\ ) is a decreasing sequence of Lebesgue measurable sets 
and ifm(F\) < +oo, then 


(13.9) 


m 


+oo 

rin 

. k=l 


lim m{F k ). 

K—> OO 


PROOF, (a) If m{E k ) — +OC' for some k € N, then both sides of 
(13.8) are equal to +oo. Hence we may suppose that m(E k ) < +oo 
for all k € N. Now let A 1 := Ex and A k := E k - E k -i for k > 1. 
Then (yl*,) is a disjoint sequence of measurable sets such that 

k +oo +oo 

E * = U A 3 and U Ek = U Ak - 

i= 1 *=i k=i 
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Since m is countably additive, then 


+oo 

u 

.fc=l 


+0O / n \ 

= = j 1 ™, ( ) • 


,)«= i 


By Theorem 13.9, we have m(Ak) = m(Ek) - m(Ek-i) for k > 1, so 
the finite sum telescopes and 

n 

^2m(A k ) = m{E n ). 

k = 1 

Hence equation (13.8) is proved. 

(b) Let Ek := Fi - Ffc for fc e N, so that (Ek) is an increasing 
sequence of measurable sets. If we apply part (a) and Theorem 13.9, 
we infer that 


+oo 

ml I J Ek ] = lim m(E n ) — lim [m(Fi) - m(F„)l 

\ V-/ / n— ► oo n—* oo L J 

\fc=l / 

= m(Fi) - lim m(F„). 


But since (JiS Ffc = Fi — ("|£“ F*, it follows from Theorem 13.9 
that 

to ^ E k ^ = to(Fi) - to ^ P) F k J . 

We now combine the last two relations to obtain (13.9). 


Q.E.D. 



CHAPTER 14 


Examples of Measurable Sets 


We have established, in the preceding chapter, that there exists 
a cr-algebra C of subsets, called the Lebesgue measurable sets, 
containing all cells in R p , and on which there is a measure function, 
called Lebesgue measure, that extends to the sets in £ the notion 
of “volume”. It was asserted that the set £ of measurable sets is 
a very large one, but this assertion has not yet been proved. The 
purpose of this chapter is to give some indications about what sets 
belong to this collection. We will show that all of the familiar sets in 
analysis are Lebesgue measurable. 

14.1 Lemma. Every open subset of R p is the union of a count- 
able collection of open cells. 

proof. Let Q denote the set of all rational numbers in R and 
let Q p := Q x ... x Q denote the collection of those points in R p all 
of whose coordinates are rational numbers. It is well known that Q p 
is a countable set; we let ( z n : n s N ) denote an enumeration of Q p . 

If G is an arbitrary open set in R p , for each z € G n Q p , let 
C{z, 1 /n z ) be the open cube (i.e., the cell all of whose sides are equal) 
with center z and side length l/n 2 , where n z is the smallest natural 
number such that C(z, 1 /n z ) is contained in G. (We have just used 
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the Well-ordering Property of N .) We now let \ '. J 

Go := jJ{G( z , l/«») : « € G n Q p ) \ 

so that Go is a union of a countable collection of open cubes. There- 
fore Go is open and Go Q G. ^ T f 

We now show that Go = G. Indeed, if y e G is arbitrary, then 
since G is open, there exists a cube G(y, 1 /n y ) with center y such 
that C(y, 1/riy) C G. Now consider the cube C(y, l/2n y ) whose side 
length is half that of C(y, 1 /n y ). Since Q is dense in R, then the set 
Q p is dense in R p , and there are infinitely many “rational” points in 
G rC(y,l/2n y ). Let w be the first such rational point, according to 
the enumeration of Q p selected above. Since w 6 G,h G(w, l/2n y ), 
it is an exercise to show that y e C{w,\/2n y ) C C(w, l/n w ) C Go- 

We conclude that G = Go, so that G is the union of a countable 
collection of open cubes, which are open cells. Q.E.D. 

14.2 Theorem. Every open and every closed subset of R p 
is Lebesgue measurable. 

PROOF. It follows from Lemma 14.1 and the fact that the collec- 
tion £ possesses property 13.1(iii), that every open set belongs to C. 
Since a closed subset of R p is (by definition) the complement of an 
open set in R p , every closed set is also Lebesgue measurable. Q.E.D. 

The intersection of a sequence (or countable collection) of open 
sets is often called a Gg- set. It is well known that such a set is not 
necessarily an open set (give an example!); however, it is a measur- 
able set, since the intersection of a sequence of sets in a a-algebra 
also belongs to the <r-algebra. Similarly, the union of a sequence (or 
countable collection) of closed sets is often called an F a -set. Such a 
set is not necessarily a closed set (example, please), but it is a mea- 
surable set. Further, a set that is the union of a sequence of Gg-sets is 
often called a G« CT -set; it is always a measurable set. Also, a set that 
is the intersection of a sequence of F^-sets is often called an F^^-set; 
it is also measurable. If we continue in this way, we can define sets 
that are 


Gs<t6i Ggfjgct R &6a6i 
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sets. All the sets that are obtained in this way are Lebesgue measur- 
able sets, so it is very difficult even to imagine a subset of R p that 
is not Lebesgue measurable. However, we will give an example in 
Chapter 17 of a subset of R p that is not Lebesgue measurable. 

BOREL SETS 

In addition to the <r-algebra £ of Lebesgue measurable sets, it 
is often convenient to work with a somewhat smaller a-algebra of 
subsets of R p . 

14.3 Definition. The smallest u-algebra of subsets of R p 
that contains all of the open sets is called the Borel <7- algebra and 
will be denoted by B. Any set in B is called a Borel set. 

A word needs to be said about the existence of the “smallest” 
<7-algebra containing the open sets. As we have noted in Remark 
(e) after Definition 13.1, the intersection of two a-algebras is again 
a <r-algebra; moreover, it is an exercise to show that the intersection 
of an arbitrary collection of cr-algebras is a a-algebra. Consequently, 
it follows from this observation that the intersection of all <r-algebras 
containing the open sets in RP is a rr-algebra that contains the open 
sets; hence it is precisely the collection of Borel sets. 

In addition to all open sets, and all closed sets, the collection B 
must also contain all G s -, all F a -, all G s<r -, all F aS -, . . . sets. On the 
other hand, since £ is a a-algebra containing all open sets, it follows 
that we must have 

BCC. 

The question naturally arises as to whether we might have equality 
in this inclusion; that is, whether every Lebesgue measurable set is a 
Borel set. The answer is: No. We will sketch a proof of this assertion 
at the end of this chapter. 

NULL SETS 

We now introduce a class of sets that are small (at least from 
the point of view of measure theory) but which often play a very 
important role. 
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14.4 Definition. A subset E C R p with m*(E ) = 0 is called 
a (Lebesgue) null set. 

Of course, the empty set 0 is a null set, as is a set consisting 
of a finite number of points. Indeed, any countable subset Z := 
{PuP 2 ,---} of R p is a null set; for, given any e > 0, let the point 
Pi be enclosed in a cell Ii with m(Ji) < e/2, let p 2 bo enclosed in 
a cell I 2 with m(/ 2 ) < e/2 2 , . .., let p n be enclosed in a cell I n with 
m(/ n ) < e/2 n , . . .. Hence, it follows from the countable subadditivity 
of m (which is a consequence of this same property of m*) that 

+ OO +OO 

0 < m(Z) < £ m(J„) < £e/2" = e. 

n — 1 n= 1 

We now show that any null set, and hence any subset of a null set, is 
a Lebesgue measurable set. 

14.5 Theorem. If Z C R p is a null set, then Z is a Lebesgue 
measurable set and hence m{Z) = 0. Moreover, any subset of Z is 
Lebesgue measurable and a null set. 

PROOF. Let A C R p be arbitrary. Then since Z D A D Z and 
A D A fl Z c , it follows from the monotone property of m* that 

m*(A) = m*{Z) + to* (A) > m*{A n Z) + m*(A n Z c ). 

Therefore, it is a consequence of Lemma 13.4 that Z is Lebesgue 
measurable; hence m(Z) = m*(Z) — 0. 

If W C Z, then 0 < m*(W) < m*(Z) = 0, whence W is also a 
null set, and therefore is Lebesgue measurable. Q.E.D. 

One sometimes says that Lebesgue measure is complete, mean- 
ing that any subset of a Lebesgue null set is measurable, and hence 
has Lebesgue measure equal to 0. 

Normally, we think of null sets as not having many points; how- 
ever, that is not necessarily the case, as we will now show. 

14.6 COROLLARY. There are Lebesgue null subsets of R p that 
contain uncountably many points. 
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proof. We will prove the assertion for the case p= 1 , and leave 
it to the reader to extend the result to an arbitrary value of p. 

We note that the familiar Cantor set [see pp. 351-353, Introduc- 
tion to Real Analysis, Second edition, John Wiley & Sons, 1992, by 
R. G. Bartle and D. R. Sherbert] is a closed subset of R. Further, 
the argument presented there shows that, given s > 0, the Cantor 
set can be enclosed in the union of countably many cells whose total 
length is less than e. Therefore, the Cantor set is a Lebesgue null set. 
However, it was also seen in the reference cited that the Cantor set 
contains uncountably many points. Q.E.D. 

TRANSLATION INVARIANCE 

The next result can be summarized by saying “Lebesgue measure 
is translation invariant” . 

14.7 Theorem. If E C R p is Lebesgue measurable and x e 
R p , then x® E is Lebesgue measurable and 

m(x®E) = m(E). 

proof. If A and B are arbitrary subsets of R p and if 2 g R T \ 
then it is an exercise to show that ye (z © A) n B if and only if 
y e z 0 (A fi [(— 2 ) © B)). Hence we have 

(z©A)nR = z© (An[(-z )0 J 5 ]). 

Similarly, one shows that x®B c = (x®B) c . Now let x, z be related by 
x ~ ~ z l then it follows from the invariance of m* under translation 
that 

(14.1) m* ((z © A) D £?) = m* (A fl (a; © B)) . 

Now let E e C and use (14.1) with B = E and B = E c to obtain 

to* (A) = m*(z © A) = to* ((z © A) n E) + m* ((z © A) n E c ) 

= m* {A fi (x © E)) + to* (A (1 (a: © E c j) 

= m*(An(x®E))+m*(Ar\(x®E) c ). 



154 The Elements of Lebesgue Measure 

for all A C R p . Therefore a; © E is also Lebesgue measurable and, 
from Theorem 12.6, it has Lebesgue measure equal to 

m*(x© E) = m*(E) = m(E). Q.E.D. 


NON-BOREL SETS 

We now return to the question of whether there exists a Lebesgue 
measurable set that is not a Borel set. We will state the result for- 
mally in the case that p = 1, but give only an outline of the argument, 
which is based on cardinal numbers. A reader not familiar with car- 
dinal numbers will do well to accept the validity of the assertion on 
faith until Chapter 17, where a more “constructive” proof is given. 

14.8 Theorem. In the space R there exist Lebesgue measur- 
able sets that are not Borel sets. ^ ^ 

SKETCH OF proof. We first notice that there is a countable 
number of open cells with rational endpoints; that is, the cardinality 
of the collection of all of these “rational cells” is equal to the cardinal 
number No- It is an exercise to show that B is the smallest a-algebra 
containing all of these “rational cells” . Therefore, it follows that the 
cardinal number of the set B is 

(No)* 0 = c, 

where c denotes the cardinality of the set R of all real numbers. On 
the other hand, it was seen in Corollary 14.6 that C contains a null 
set with uncountably many elements; in fact, the Cantor set can be 
seen to have c elements. Since an arbitrary subset of a null set is a 
Lebesgue measurable set, then C contains at least 2 C sets. But since 
R has only 2° subsets, it follows that the cardinality card(£) = 2 C . 
Therefore, we have 


card(B) — c < 2 C = eard(£), 


whence B is properly contained in C. 
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Approximation of Measurable Sets 


We now show that arbitrary Lebesgue measurable sets are pre- 
cisely those sets that can be approximated by open and closed sets. 
In fact, it would be possible to take these characterizations as the def- 
inition of measurability. However, we have chosen not to do so, since 
in abstract measure spaces there is no notion of “open” or “closed”, 
so the Caratheodory condition is the only method that is possible of 
determining measurability. 

APPROXIMATION BY OPEN SETS 

First we will show that every subset of R p can be enclosed in a 
Gg- set with the same outer measure. 

15.1 Lemma, (a) If A C R p and e > 0, then there exists an 
open set G C R p such that A C G and m(G ) < m*(A ) + e. Hence 

(15.1) m*(A ) = inf {m(G) : A C G, G open}. 

(b) If A C R p , then there is a Gg-set H such that A C H and 
m*(A) = m(H). 

PROOF, (a) We may assume that m*(A) < +oo. (Why?) It was 
noted in Remark 12.2(b) that we may require the cells in Definition 
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12.1 to be open. Thus there exists a sequence ( I fc ) of open cells 
covering the set A such that 

+oo 

Y,l(h)<m*(A)+e. 

fc= 1 

If we let G := h, then G is open and, by the countable subad- 
ditivity of m* and Theorem 12.5, we have that 


+oo +oo 

m(G) < £m*(Jfc) = £/(/*) < m*(A) + e. 
fc=i fc=i 

Equation (15.1) now follows from the definition of the infimum. 

(b) For each n € N, let G n be an open set such that A C 
G n and m{G n ) < rn*(A) + 1/n. Now let H := £» n , so that 

A C H C G n and m*(A) < m(H) < m*(A) + 1/n for all n € N. 
Therefore m*(A) = m(H ), as asserted. Q.E.D. 

15.2 Corollary. Every Lebesgue null set is a subset of a Borel 
null set. 

proof. If Z is a Lebesgue null set, there is a Gg- set H such that 
Z C H and m{H) = 0. But H is a Borel set. Q.E.D. 

Unfortunately, in Lemma 15.1(b), the difference H — A need not 
be a “small” set. In fact, it will be seen in Corollary 15.5 that the set 
A is Lebesgue measurable if and only if the set H — A is a null set. 

15.3 Theorem. A set E C R p is Lebesgue measurable if and 
only if for every e > 0 there exists an open set G with E C G and 
m*(G — E) < e. 

proof. We first assume that E is measurable and that m(E) < 
+oo. Then, by Lemma 15.1(a), there exists an open set G such that 
E C G and m(G) < m(E) + s. Since E is measurable and E C G, 
we have 


m(G) — m(G (1 E) + m(G — E) — m(E) + m(G — E). 
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Since m(E) < +00, we have 

m(G -E) = m(G) - m(E) < e. 

If m(E) = +oo, let Ei := E fl {x : ||x|| < 1} and, if n > 2, let 
E n := E D {x : n — 1 < ||x|| < n}. For n £ N, let G n be an open 
set with E n C G n and m(G n - E n ) < e/2 n . If we let G := Un^i 
then G is open, BCG, and 

-f-oo 

G-E C |J (G n - E n ). 

n= 1 

Therefore, from the countable subadditivity of m*, we have 

+oo +oo 

m(G -E)<^2 m(G n - E n ) < ^ e/2 n = e. 

n = 1 n=l 

Conversely, suppose that for every n € N there exists an open 
set G n D E such that m*(G n -E) < 1/n. Let H f|^ G n so that 
H is a G , 5 -set (and hence is measurable). Moreover, since H C G n , 
we have H - E C G n - E and hence 

0 < m*{H - E) < m*(G n - E) < 1/n 

for all n e N . Therefore m*(H — E) = 0, which implies that Z := 

H — E is a measurable set. Therefore, E — H - Z is a measurable 
set. Q.E.D. 

15.4 Corollary. If E C R p is measurable, then for any £ > 0 
there exists an open set G D E with m(G) < m(E) + s. f Therefore , 
we have X Xl l *■ - • / 

m{E) = inf{m(G) : G open, G D E}. 0 

PROOF. Indeed, from Theorem 15.3, we have m(G) = m(E) + 
m(G — E) < m(E) + e. Q.E.D. 

The next result is concerned with approximation of a set by a 
G^-set from the outside. It is a useful characterization of Lebesgue 
measurability. 
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15.5 Corollary. The following statements are equivalent: 

(a) The set E C R p is Lebesgue measurable; 

(b) there exists a Gg-set H with E C H and m* (H-E) = 0; 

(c) there exist a Gg-set H and a Lebesgue null set Z such 
that E C H, Z C H, and E = H - Z. 

We leave it as an exercise to write out the details of the proof. 
APPROXIMATION BY CLOSED SETS 

We now show that Lebesgue measurable sets are those sets that 
can be well approximated from inside by closed sets. 

15.6 Theorem. A set E C R p is Lebesgue measurable if and 
only if for each e > 0 there exists a closed set F with F C E and 
m*(E - F) < e. 

proof. If E is measurable, then its complement E c is also mea- 
surable. By Theorem 15.3 there exists an open set G with E c C G 
and m(G — E c ) < s. Now let F := G c so that F is closed, F C E 
and E — F = EP\G — G — E c , whence m(E - F) = m{G — E c ) < e. 

Conversely, for each n £ N there is a closed set F n C E with 
m*(E — F n ) < 1/n, and we let K := F n . Then AT is a F„- set 
(and is therefore Lebesgue measurable) and, since F n C K, we have 
E — K C E — F n , so that m*(E — K) < m*(E — F n ) < 1/n for all 
n. Therefore m* (E — K) = 0 which implies that Z := E — K is a 
measurable set. Therefore E = K U Z is measurable. Q.E.D. 

15.7 Corollary. If E C R p is Lebesgue measurable, then for 
any e > 0 there exists a closed set F C E with m(E) < m(F) + s. 
Therefore, we have 

m(E) = sup{m(F) : F closed, F C E}. 

PROOF. The set F in the theorem is measurable and F C E; 
hence m(E) = m*(EC\F) + m*(E — F) < m(F) + e. Q.E.D. 

The next corollary characterizes measurability in terms of the 
approximation by F„- sets from inside. 
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15.8 Corollary. The following statements are equivalent: 

(a) The set E C R p is Lebesgue measurable; 

(b) there exists an F^-set K with K C E and m*(E-K) — 0; 

(c) there exist an F a -set K and a Lebesgue null set Z such 
that K C E, Z C E, and E = Kli Z. 

We leave it as an exercise to write out the details of the proof. 

APPROXIMATION BY COMPACT SETS 

If E is a Lebesgue measurable set with m{E) < +oo, then we 
can approximate it from within by compact sets, and conversely. We 
recall that a compact set always has finite Lebesgue measure. 

15.9 Theorem. A set E C R p with m*(E) < -foo is Lebesgue 
measurable if and only if, for every t > 0 there exists a compact set 
C with C C E and m*(E — C)<e. 

PROOF. If E is measurable and ne N, let E n be the set defined 
by E n := E C\ {x : ||x|| < n}. Since the sequence (E n ) increases to 
E, it follows that the numerical sequence ( m(E n )) also increases to 
m(E) < +oo, so there is an n 0 such that m(E) < m(E no ) + e/2. By 
Theorem 15.6 there is a closed set C with C C E no and m{E no -C) < 
e/2. Since E is the union of the disjoint sets E - E no and E no , it 
follows that 

m(E) = m(E-E no )+m(E no ) 
and since m(E) < +oo, that 

m(E - E no ) = m{E) - m(E no ) < e/2. 

In addition, E — C is the union of the disjoint sets E — E no and 
E no “ C- Therefore 

m(E -C) = m(E - E no ) + m(E no - C) < e. 

Since C C E no is closed and bounded, it is compact in R p . 

Conversely, suppose that for every n G N there is a compact set 
C n with C n C E and m*(E - C n ) < 1/n. If we set C := (J+~ C n , 



160 The Elements of Lebesgue Measure 

then C is measurable and E - C C E - C n for all natural numbers 
ne N, and it follows that m*(E-C) = 0. Therefore Z E-C is a 
Lebesgue null set and hence is measurable. Consequently, E = CU Z 
is Lebesgue measurable. Q.E.D. 

APPROXIMATION BY CELLS 

We now show that a set with finite measure can be approximated 
by a finite union of bounded cells. We recall that the symmetric 
difference of two sets A, B is the set AAB (A-B)U(B-A). 

15.10 Theorem. If E € C has Unite measure and e > 0, then 
there exist bounded open cells I u ...,I n such that if K := |J”=i 
then m(EAK) < e. 

proof . As in the proof of Lemma 15.1, there exists a sequence 
(fi)Z^i of bounded open cells covering E such that if G := U^°i 
then m(G) < m(E) + e/2. Similarly, by Theorem 15.9, there exists 
a compact set C C E such that m(E - C) < e/2. It follows from the 
Heine-Borel Theorem that a finite number of the cells, say 1\ 
cover C. If K := (J” =1 I iy then since C C K C G and C C E C G, it 
follows that 


m(EAK) = m(E — K) + m(K — E) 

<m(E -C) + m(G - E) <e. Q.E.D. 


It is left as an exercise to show that the cells Ji, . . . , /„ can be 
chosen to be closed, or half-open, or pairwise disjoint. 



CHAPTER 16 


Additivity and Nonadditivity 


In this brief chapter we establish the surprising fact that the 
outer measure function m* is additive over the union of two disjoint 
sets provided that one of them is measurable. We will also give some 
other results concerning the additivity and nonadditivity properties 
of m*. In addition, we will show that if a set is known to be contained 
in a measurable set with finite measure, then the Caratheodory con- 
dition can be replaced by a single test, and relate this result with the 
notion of “inner measure” . 

16.1 Theorem. Let E be a Lebesgue measurable subset of R p 
and let F be any subset of R p . Then: 

(a) to*(I?U F) + m*(E nF) = m(E) + m*(F); 

(b) if £nF = 0, then m* (E U F) = m(E) + m* (F); 

(c) if m(E) < +oo and E C F, then we have m* (F — E) — 
m*{F) — m(E). 

proof . Since E & C, it follows from the Caratheodory condi- 
tion that m* (A) = to* (A n E) + to* (A - E) for all A C R p . If we 
take A := E U F, we obtain 


m*{EUF)=m*((EuF)nE) + m*((EUF)-E) 
= m(E) + m*(F-E), 
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and if we take A F, we obtain 

m*(F) = m*(F n E) + m*(F - E). 

Therefore, we have 

m*{E\JF) + m*(EnF) = [m(E) + m* (F - E)] + m* (E n F) 

= m(E) + [m*(F - E) + m*(E n F)\ 

= m(E) + m*(F), 

as asserted. 

(b) If E fl F = 0, then m* (EOF) = 0, so the conclusion is 
immediate. 

(c) Let H := F - E, so that F = E U H and E n H = 0. 
Hence, from (b) we have 

m*(F) = = m{E) + m*{H) 

= m{E) +m*(F - E). 

Since m(E) < +oo, we know that m*(F) and m*{F - E) are either 
both +oo or both finite. Hence the assertion follows. Q.E.D. 

We saw in Lemma 15.1(b) that for any set E C R p there exists 
a G«-set H such that E C H and m*{E) — m(H). It also follows 
that E is Lebesgue measurable if and only if H - E is a null set. The 
next theorem is somewhat remarkable in that a similar approximation 
from within guarantees the measurability of a set E with finite outer 
measure merely from the equality of the outer measures of the sets. 

16.2 Theorem. Ifm*(E ) < +oo, then E is measurable if and 
only if there is a measurable set B C E with m(B) = m*(E). 

PROOF. If E is measurable, then we can obviously take B := E. 
On the other hand, if B € C, B C E, and m{B) = then 

it follows from Theorem 16.1(c) that 


m*{E - B) = m*(E) - m(B) = 0. 
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Consequently, the null set E—B is Lebesgue measurable and therefore 
E = (E - B) U B is also measurable. q.e.d. 

CARATHEODORY REVISITED 

We now obtain a modification of the Caratheodory condition 
that involves only testing with a single measurable set of finite mea- 
sure that contains the given set. 

16.3 Theorem. Let A C R F be Lebesgue measurable with 
m(A) < +oo. Then E C A is Lebesgue measurable if and only if 

(16.1) m{A) = m*(E) + m*(A — E). 

Proof. If E is measurable, the assertion follows immediately 
from the Caratheodory condition. 

Conversely, by Theorem 15.1(b) applied to the set A — E, there 
is a G^-set H with A - E C H and m*(A - E) — m(H). Since 
A — E C An H C H, it follows that 

to* (A — E) <m(AnH) < m(H) = m*(A — E), 

whence m(A n II) = m*(A — E). But since An il is measurable and 

AD(AnH) = AnH and A - (An H) = A - H, 

we conclude that 

m(A) = m(A n H ) + m(A -H)=m*(A-E) + m(A - H). 

If we use equation (16.1), we deduce that 

m(A - H) = m*(E). 

But since B := A - H C E, it follows from Theorem 16.2 that E is 
Lebesgue measurable, as claimed. Q.E.D. 

It is often the case that we are concerned with sets that are 
contained in some large cell, such as 


J„ := \-n,n ] x ... x \-n,n\, 
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for some n € N . The preceding theorem is useful in establishing the 
measurability of a set E C J n . 

16.4 Corollary. A set E C J n is Lebesgue measurable if and 
only if 

(16.2) m(J n ) = m*(E) + m*(J n -E). 

PROOF. This follows immediately from Theorem 16.3 and the 
fact that J n is measurable (Theorem 13.7). Q.E.D. 

For an unbounded set E C R p , the next result is useful. 

16.5 Theorem. A set E C R p is Lebesgue measurable if and 
only if the sets E fl J n are measurable for each n e N. 

proof. If E is measurable, then the result is trivial. 

Conversely, if each set E n := EnJ n is measurable, then it follows 
from the fact that E = (J^~i E n , that E is measurable. Q.E.D. 

INNER MEASURE 

Readers may initially have been surprised that we have focused 
our attention on outer measure and have not defined a notion of the 
“inner measure” of a set by inscribing a collection of cells inside a 
given set and taking the supremum of the resulting inner approxima- 
tions. One of the reasons we have not done so is that, in general, a 
measurable set — even one with positive measure — may not con- 
tain any cells with positive volume. (Construct an example, please.) 
However, there is a way around this difficulty for a set E that is con- 
tained in a cell J n . Namely, we define the inner measure m*(E) of 
E to be the difference 

m*{E) := m(J„) - m*(J n - E ). 

With this definition, Corollary 16.4 takes the form: A set E C J n 
is Lebesgue measurable if and only if its inner measure and its outer 
measure are equal. This observation is often used in the process of 
defining Lebesgue measure in the interval [0, 1], 



CHAPTER 17 

N onmeasurable and Non-Borel sets 


In this chapter we will establish the existence of a set in R p that 
is not measurable. In order to do so, we will need to use the “Axiom of 
Choice” . The first person to give an example of a nonmeasurable set 
was Giuseppe Vitali, in 1905. In 1970, R. M. Solovay showed that the 
use of the Axiom of Choice is essential, in a certain technical sense. 
Although other constructions of nonmeasurable sets have been given, 
the author is not aware of any that are simpler than Vitali ’s example. 
We will also obtain some other results about the nonadditivity that 
is characteristic of nonmeasurable sets, and establish the existence 
of a nonmeasurable set such that neither the set nor its complement 
contains any measurable sets that are not null sets. At the end of this 
chapter we give a proof of the existence of a Lebesgue measurable set 
in R that is not a Borel set. 

The following definition will be useful. 

17.1 Definition. If A C R p , then its difference set is defined 
to be 

A © A := {x ■ — y : x, y € A}. 

It is trivial, but useful, to observe that if A C B, then AqA C BqB. 
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17.2 Lemma. Let K C R p be a compact set with m(K) > 0. 
Then the difference set K © K contains an open ball with center at 
the origin of R p . 

proof. Since 0 < m(K) < +oo, there exists an open set G 
such that K C G and m(G) < 2 m{K). Since K is compact and 
G c = R p — G is closed, we conclude that 

S := dist^, G c ) > 0. 

This implies that if ||x|| = dist(ar,0) < S, then x © K C G. 

We claim that (x © K) fl K ^ 0. For, if not, then since we have 
K U (x © K) C G, it follows from ( x © K) C\K — 0 and the additivity 
of m that 


2 m(K) = m(K) + m(x © K) = m(K U (a; © K )) 

< m{G) < 2m(K), 

which is a contradiction. Therefore (x © K) n K ■£ 0 for all x with 
INI < S. But this implies that if ||x|| < 6 , then there exist k x , k 2 € K 
such that x = k x -k 2 EKeK. Therefore the set K © K contains 
the open ball with center 0 and radius 8. Q.E.D. 

17.3 Theorem. If E C R p is any Lebesgue measurable set 
with m(E) > 0, then the difference set E © E contains an open ball 
with center 0. 

PROOF. For n€N, let E n :={xeE: ||x|| < »}. Since m(E) = 
lim n rn(E n ), we have m(E n ) > 0 for n sufficiently large, say for all 
n > n 0 . We note that 0 < m{E no ) < +oo. By Theorem 15.9 there 
exists a compact set K C E no C E with 0 < (1/2 )m(E no ) < m(K). 
Since K C E, it is clear that K © K C E © E. By the preceding 
lemma, we conclude that K © K contains an open ball with center 0; 
therefore, so does EQ E. q.e.d. 

17.4 DEFINITION. If X, y e R p , we say that X is rationally 
equivalent to y and write x~y if x-ye Q p \ that is, if the 
components -yi £ Q for ali i = 1,2, ... ,p. 
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It is easy to see that rational equivalence is an equivalence rela- 
tion on the set R p [in the sense that (i) x ~ x; (ii) a; ~ y if and only 
if y ~ x; and (iii) x ~ y and y ~ 2 imply that x ~ z). Hence rational 
equivalence on R p divides R p into a collection of disjoint equivalence 
classes. Using the Axiom of Choice, we form a set V by choosing one 
representative from each equivalence class, so that if v\,V 2 6 V and 
v\ ~ V 2 , then v\ = n 2 . We refer to any such set V as Vitali’s set. 
Moreover, if q £ Q p , we write 

(17.1) V q := q® V. 

It is easy to see that if q, q' e Q p , q ± q', then V, n V q ' = 0. 

17.5 Lemma. Let < 71 , q 2 ,... be an enumeration of the countable 
set Q p . Then R p can be represented by the disjoint union: 


+00 +00 

(17.2) R p =\JV qi = {J( qi ®V). 

i=l i= 1 

Therefore, every element x e R p has a unique representation in the 
form x — q { + v for some q t and some v € V. 

proof. If x € R p , then x belongs to a unique rational equiva- 
lence class. If v is the representative of this class, then x - v = q t for 
some i, so x = q t + v € V qi . If i ^ j, then the sets V qi and V qj are 
disjoint, as noted above. Q.E.D. 

17.6 Theorem. Vitali’s set V is not Lebesgue measurable. 

proof. Suppose, on the contrary, that the set V is measurable. 
We have two possibilities: (i) m(V) > 0, or (ii) m(V) = 0. 

Case (i). If m(V) > 0, then Theorem 17.3 implies that the 
difference set V © V contains an open ball with center 0. Therefore 
there exists a nonzero element x in this ball, all of whose coordinates 
are rational numbers. Since x belongs to this ball, there are elements 
vi,V 2 € V such that x = v\ —V 2 . But this means that v\ ~ v 2 , whence 
we conclude that v\ = x 2 and x = 0, a contradiction. Therefore, this 
case is not possible. 
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Case (ii). Suppose that m(V) = 0. Since Lebesgue measure is 
translation invariant, we have m{q@ V) = 0 for all q e Q p . It follows 
from the countable additivity of m and from (17.2) that 

+oo 

0 < m(R p ) = ^ m(<7j © V) = 0. 

i— 1 


Therefore, we have m{R p ) = 0, which implies that the -measure of 
every measurable subset in R p is equahto-0, which is a contradiction. 

Therefore, Vitali’s set is not Lebesgue measurable. Q.E.D. 

We now show that any set with positive outer measure contains 
a subset that is not Lebesgue measurable. (Thus, nonmeasurable sets 
are everywhere in R p \) 

17.7 Theorem. Any set E C R p with m*(E) > 0 contains a 
nonmeasurable subset. 

proof. By the preceding theorem, the translates V 9i = q t © V 
are not measurable; however, it is conceivable that their intersections 
Ei := E fl V,,. might be measurable. However, if E t is measurable 
for some i and has positive measure, then it follows from Theorem 
17.3 that the difference set E t © E t must contain a ball. But since 
Ei C V <u , it follows that the difference set V qi © V 9i = V © V must 
also contain a ball, which is contrary to the construction of V. We 
conclude that the sets Ei that are measurable must be null sets. It 
follows from (17.2) that 

+oo +oo 

E=[jEnV qi =\jEi. 

t=i j=i 

If all of the sets Ei are measurable, we have just seen that they must 
be null sets, so E is also a null set, contrary to hypothesis. Therefore, 
at least one of the sets E t is not Lebesgue measurable. Q.E.D. 

We now show that every measurable set with finite positive mea- 
sure has a nonadditive decomposition into the union of two sets that 
are nonmeasurable. 
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17.8 Theorem. Let E be a Lebesgue measurable set such that 
0 < m(E) < +oo. Then there exist nonmeasurable subsets B and C 
of E such that E = BllC, BnC = 0, and 

(17.3) m(E) <m*(B)+m*(C). 

PROOF. It follows from Theorem 17.7 that the set E has a non- 
measurable subset B, and we let C := E - B, so that E = BuC 
and B n C = 0. Moreover, since B = E - C, the set C must also be 
nonmeasurable. We conclude from the subadditivity of m* that 

(17-4) m*(E) < m*(B) + m*(C), 

However, if equality holds in (17.4), then it follows from Theorem 
16.3 that B is a measurable set, which is a contradiction. Therefore, 
we have (17.3). q.e.d. 

We will now see that every nonmeasurable set with finite outer 
measure is part of a nonadditive decomposition of a measurable set. 

17.9 Theorem. Let B be a nonmeasurable set such that 
m*(B) < +oo, and let H be a Gg-set set with B C H and m*(B ) = 
m(H). Then C := H - B is also nonmeasurable and 

m(H) = m(B U C) < m*(B) + m*{C). 

PROOF. The existence of H was established in Theorem 15.1(b). 
Since B — H— C, it follows that C must be nonmeasurable. Therefore 
m*(C) > 0, whence the strict inequality follows. q.e.d. 

We now show that there exists a nonmeasurable set that can be 
said to be “ubiquitous”, since every measurable subset of it (and of 
its complement) is a null set. 

17.10 Theorem. There exists a nonmeasurable set U C R p 
such that every Lebesgue measurable subset of U is a null set, and 
every measurable subset of its complement U c is also a null set. 
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PROOF. Let G and Go be defined by 

G := Q p © v/2 Z p and G 0 :=. Q p © (2v / 2)Z p , 
and let Gi be defined by 

G\ := (v/2,... , v/2) ®G 0 . 

It is easy to see that 

G = {(gi + niv/2, . . ., g p + n p v/2) : g, € Q,rn G Z}, 

Go = {(gi + 2n lV /2, . . ., g p + 2n p \/2) : g* € Q, n< e Z}, 

Gu — {(gi + (2»i + 1)V5. • • .,g P + (2n p + : qi £ Q,rii € Z}. 

We note that G and Go are subgroups of R p under addition, and 
that G, Go and Gi are dense in R p . It is also clear that 

G = Go © G\ and Go n Gi = 0. 

For each pair x,y € R p , we define x & y to mean that x — y e G, 
so that « is an equivalence relation on R p . We use the Axiom of 
Choice to obtain a set £ C R p containing a single element from each 
equivalence class in R p , so that if g ^ g', then (g ® £) n (g' © £) = 0. 
Consequently, we have the disjoint decomposition 

R p =\J(g®£). 

g€G 

We now define our desired set by 
(17.5) W:=G 0 ©£= 

g£G 0 

We note that since Go n Gi = 0, it follows that the complement U c 
is given by 

U c = G t © £ = (J (g © £). 

g£G x 
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Let F C U be measurable; for the purpose of obtaining a contradic- 
tion, we assume that m(F) > 0. If so, then by Theorem 17.3 there is 
a ball contained in F © F C U 0 U. But, since G\ is dense in R p , it 
must meet this ball and hence G i meets U QU. In order for this to 
be true there must exist an element gi € G\ of the form go + e\-e 2 , 
where go e Go and ei,e 2 € £. Therefore ei — e 2 = g\ — go e G, 
which implies that e\ ~ e 2 and hence e\ = e 2 and g\ = go. Since 
G± n Go = 0, we have obtained a contradiction. Therefore, we con- 
clude that any measurable subset of U is a null set. 

We have already noted that U c — G\@ £. Consequently, if 
F\ C U c , then F\ has the form 

F 1 = (V2,...,V2)@F 

for some subset F of U. Moreover, F\ is measurable if and only 
if its translate F is measurable. But it follows from the preceding 
paragraph that the only measurable subsets of U are null sets. By 
the translation invariance, we infer that Fi is also a null set. 

We conclude by noting that U must be nonmeasurable. For, if 
it is measurable, then its complement U c = (y/2, . . . , \/2) ® U, being 
a translate, is also measurable. By what we have seen above, both 
U and its complement W must be null sets . Consequently, R p is a 
null set, a contradiction showing that U is nonmeasurable. q.e.d. 

THE EXISTENCE OF NON-BOREL SETS 

Let F be the Cantor set, obtained by deleting “middle thirds” 
from the unit interval I := [0, 1], If x € F, then x has a unique 
ternary (that is, base 3) expansion represented by 

x = (O.CiC 2 C 3 . . ,) 3 , 

where c* = 0 or 2 for all k € N. We define a mapping <p : F — * I by 

¥>(*) :=(0.(c 1 /2)(c 2 /2)(c 3 /2)...) 2 , 

using the binary (that is, base 2) expansion of the number. Clearly, 
if x',x" e F and x' < x", then there exists k e IV such that all of 
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the digits in the ternary expansion of x' are equal to those in the 
expansion of x" up to the kth digit, but the fcth digit in the ternary 
expansion of x' is 0 and the kth digit in the ternary expansion of 
x" is 2. Hence it follows that p{x') < p(x ") so that p is a mono- 
tone nondecreasing map of F into I. Note, however, that p is 
not one-one; for example, if x' = (0.0220) 3 ^a; ,/ = (0.0202) 3 , then 
<p{x') = (0.0110) 2 = (0.010I) 2 = p(x"). In fact, p(x’) = p{x") for 
x' < x" if and only if these points have the form 

x' — (0.cic 2 . . . c*,02) 3 , x" = (0.cic 2 . . . Ck 20) 3 , 

and this holds if and only if x' and x" are left and right end points 
of a middle third interval 


(0.cjc 2 . . . c*,10) 3 < x < (0.cic 2 . . . c k 20) 3 , 


removed in the process of constructing F. It is also clear that p 
maps F onto I, since if y = (0.6i& 2 ■ • O2 is the binary expansion of 
an arbitrary number in I, then y is the image under p of the number 
x = (0.(26i)(26 2 ) . . ,) 3 in F. 

We now extend p to be defined on all of I by defining it to be 
constant on each of the middle third sets that are removed from I in 
the construction of F. We have noted that ip takes the same value 
at both end points of such a middle third set. In particular, 

p{x) = (0.0i) 2 - (0.10) 2 = i 

for all x satisfying (0.02) 3 = 3 < x < § = (0.20) 3 . Also, 

p{x) = \ for £<«<!; <p(x) = f for | < x < |; .... 

This extended function, which we will continue to denote by the letter 
p, is evidently a monotone nondecreasing function mapping I onto 
I and does not have any jump discontinuities, since every value of 
I is taken on at least once. Therefore the extended function p is 
continuous at every point of I. We also note that the derivative 
p'(x) = 0 for all points x e I — F, since p is constant on some 
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neighborhood of such a point. This extended <p is often called the 
Lebesgue singular function. Since the Lebesgue measure of the 
set F is 0, we see that the derivative of p exists almost everywhere 
and is equal to 0. However, p is far from being a constant function, 
since it maps I onto the set I. 

We now define ip on I to the closed interval [0, 2] by 

i ){x ) := x + p{x). 

It is clear that ip is a monotone nondecreasing function, since it is the 
sum of two such functions. In fact, ip is strictly increasing on /, so 
that ip is a one-one map of I onto the closed interval [0, 2]. Moreover, 
since ip is the sum of two continuous functions, it is continuous on 

I. It follows that the inverse function ip~ l , which maps [0,2] onto 

J, is also continuous. Therefore, ip is a homeomorphism (that is, a 
continuous function whose inverse is also continuous) of I and [0,2], 
and ip has the property that if B is a Borel set on JR, then both i p(B) 
and ip~ 1 (B) are Borel sets. 

Since p is constant on each of the middle third sets in I — F, we 
see that ip maps such a middle third set into an interval of the same 
length. Consequently, 

m(ip(I - F)) = m(I - F) = 1 , 

and since m([0, 2]) = 2, it follows from the fact that [0, 2] = ip(F) U 
ip(I — F) and ip(F) fl ip{I — F) = 0, that we have 

2 = m(ip{F)) + m(ip(I — F)). 

Consequently, we must have m(ip(F)) = 1. We conclude that the 
homeomorphism ip maps the set F, which has Lebesgue measure 0, 
to a set with Lebesgue measure equal to 1. 

Since ip(F ) has positive measure, we know from Theorem 17.7 
that it contains a set W that is not Lebesgue measurable. Then the 
set W\ := ip~~ l (W) is a subset of F and hence is a Lebesgue null 
set; consequently, Wi is a Lebesgue measurable set. However, 
cannot be a Borel set, since if it were, then W = ip(W\) would also 
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be a Borel set, and hence a Lebesgue measurable set. But this is 
contrary to the choice of W as a nonmeasurable set. 

We state this result formally. 

17.11 Theorem. There exist Lebesgue measurable subsets of 
R that are not Borel sets. 

We observe that we have also given an example of a Lebesgue 
measurable set W whose image under a strictly monotone homeo- 
morphism (and hence Borel measurable function) if is not Lebesgue 
measurable. Thus, although a homeomorphism always maps Borel 
sets to Borel sets, it may map a Lebesgue measurable set to a non- 
measurable set. 
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